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INTRODUCTION 


No  exact  date  can  be  designated  as  marking  the  birth  of  organizational 
development.  Perhaps  the  late  1950's  or  early  1960's  marked  the  first  use 
of  the  specific  term.  What  has  taken  place,  therefore,  has  occurred 
within  the  last  15  years,  years  which  have  seen  a substantial  investment 
in  the  range  of  activities  loosely  representing  this  applied  field. 

Although  we  lack  exact  dollar  counts,  a plausible  estimate  of  the  total 
funds  invested  in  organizati onal  development  must  run  to  hundreds  of 
millions  of  dollars.  By  any  standard,  this  is  a large  amount,  one  that 
no  entity  — whether  it  be  public  or  private  --  may  take  lightly. 

This  same  time  period  represents  as  well  the  first  point  at  which 
it  was  conceptualized  as  organi zational  development,  as  opposed  to  manage- 
ment development  or  simply  training.  No  exact  definition  has  general 
currency,  but  the  term  is  generally  taken  to  refer  collectively  to  an 
assortment  of  training  or  therapeutic  interventions  whose  aim  is  presumed 
to  be  improvement  of  the  organization  and  its  members. 

However  it  is  operationally  defined,  the  problem  of  organizational 
development  and  change  would  appear  to  contain  two  component  subprocesses, 
diagnosis  and  therapeutic  intervention  (Bowers  & Franklin,  1977). 

Although  equally  crucial  to  the  success  of  any  development  effort,  diagnosis 
takes  prior  importance  for  the  simple  reason  that  it  occurs  earlier  in  the 
flow  of  developmental  events,  Thus,  an  important  part  of  the  consultant's 
role  is  often  presumed  to  consist  of  translating  a wide  variety  of  symptoms 
into  a coherent  pattern  that  permits  planning  and  carrying  out  appropriate 
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remedial  action.  According  to  Lawrence  & Lorsch  (1969),  the  reasons  for 
the  importance  of  diagnosis  in  organizational  development  are  many  and 
persuasive:  *• 

(1)  The  client  system  may  not  be  aware  of  the  problem  at  all. 

For  example,  the  difference  between  present  effectiveness 
levels  and  unanti ci pated  opnortuni ties , rather  than  obvious 
difficulties,  may  be  the  "problem." 

(2)  The  client  system  may  not  be  aware  of  the  real  problem. 

(3)  A discrepancy  between  actual  and  desired  outcomes  does  not 
explain  and  account  for  itself. 

(4)  Problem  variance  is  likely  to  be  multiply  caused. 

(5)  Causes  are  likely  to  interrelate  in  complex  ways. 

(6)  Causes  are  likely  to  differ  greatly  in  potency,  and  what 
is  desired  is  a designation  of  variables  with  leverage. 

(7)  Meaning  can  only  be  given  to  causal  information  by  casting 
it  into  an  appropriate  configuration  against  a set  of 
principles. 

(8)  What  is  required  for  action  planning  is  an  overall  and 
integrated  view,  not  a parochial  one. 

(9)  Diagnosis,  if  done  we  11,  provides  some  insurance  against 
rushing  into  an  inappropriate  treatment  that  may  prove 
damaging. 
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In  contrast  to  this,  an  article  by  one  of  the  present  authors  (Bowers, 
1976)  turned  attention  to  assumptions  concerning  the  consultant's  diagnosti 
rol e in  O.D.  The  points  made  there  bear  repeating. 

. While  a number  of  writers  have  attributed  a diagnostic 
role  to  consultants,  what  goes  unrecognized  is  that 
their  diagnoses  are  often  put  to  little  other  than 
heuristic  use  (that  is,  they  are  used  merely  to 
stimulate  an  interesting  discussion). 

. An  unpublished  study  of  consultants'  diagnostic  skills 
showed  (a)  inability  to  agree  with  diagnostic  con- 
clusions more  formally  obtained,  and  (b)  more  positive 
change  occurring  where  consultants  did  relatively  little 
diagnosing  than  where  they  did  a great  deal  of  it. 

. Most  consultants  currently  employ  diagnostic  methods 
which  rely  upon  one  observer — the  consultant  himself 
or  herself--to  obtain  data.  The  N is  restricted,  not 
only  in  this  fashion,  but  also  by  the  fact  that  this 
consultant-observer  is  limited  to  a time-bound  behavior 
sample. 

These  observations  should  not  surprise  us.  Findings  from  the  general 
field  of  assessment  and  classification  have  provided  strong  support  to 
the  position  that  statistical  prediction  is  superior  to  non-statistical 
or  judgmental  methods  (Cronbach,  1960).  For  example,  in  Meehl's  (1954) 
major  review  of  clinical  versus  statistical  prediction,  it  was  found  that 
statistical  prediction  was  equal  to  or  superior  to  clinical  prediction 


in  19  out  of  20  cases. 
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Citing  this  body  of  accumulated  evidence,  Cronbach  explores  the 
reasons  for  perenially  poor  showings  by  (clinical)  judges: 

. Judges  combine  data  by  means  of  intuitive  weightings 
which  they  have  not  checked. 

. Judges  casually  change  weights  from  one  case  to  the 
next. 

. Judges  are  unreliable,  in  the  sense  that  the  same 
case  might  not  be  judged  the  same  way  twice  in 
succession. 

. Judges  have  stereotypes  and  prejudices  which  affect 
their  judgments. 

His  conclusions  are  the  following: 

"What  does  this  imply?  It  implies  that  counselors, 
personnel  managers,  and  clinical  psychologists  should 
use  formal  statistical  procedures  wherever  possible 
to  find  the  best  combining  formula  and  the  true 
expectancies  for  their  own  situation.  They  should 
then  be  extremely  cautious  in  departing  from  the 
recomnendations  arrived  at  on  the  basis  of  the 
statistics..."  (p.  348) 

If  this  is  the  desirable  state  for  organizational  development  as  well, 
it  is  scarcely  what  in  fact  obtains.  Levinson  (1972,  1973),  in  his 
published  remarks  which  led  to  the  celebrated  exchange  with  Burke  (1973) 
and  Sashkin  (1973),  stated  that  there  is  little  resembling  formal  diagnosis 
in  O.D.  Consistent  with  Kahn's  (1974)  observations,  Levinson  stated 
that  the  field  is  characterized  by  "ad  hoc  problem-solving  efforts  and  a 
heavy  emphasis  on  expedient  techniques."  Tichy  (1974)  does  not  reassure 
us  when  he  finds,  in  his  systematic  empirical  study,  that  change  agents 
(consultants)  seem  to  have  limited  diagnostic  perspectives,  that  their 
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diagnostic  frameworks  are  rather  closely  limited  to  thei^  personal  values 
and  goals,  and  that  the  potential  for  intrusion  of  bias  is  not  small. 

Unfortunately,  recommended  alternatives  are  relatively  scarce. 
Levinson's  recommendations  build  upon  a view  and  a method  of  organizational 
diagnosis  that  is  an  extension  of  the  clinical  case  method.  While  large 
amounts  of  empirical  data  would  be  gathered,  injecting  a clinical  judge 
between  the  data  and  the  conclusion  runs  the  risks  listed  above  by 
Cronbach. 

On  the  other  hand,  this  is  not  the  situation  nor  the  age  for  "raw" 
empiricism.  As  the  lengthy  discussions  nationally  about  discrimination 
in  testing  have  revealed,  in  the  interest  of  fairness  and  equal  treatment, 
more  must  be  taken  into  account  in  a decision  process  than  any  simple  set 
of  numbers,  especially  where  connections  between  the  numbers  and  real 
world  events  may  not  be  obvious.  In  a similar  vein,  the  sudden  rise  of  the 
assessment  center  concept  has  shown  that  an  appropriate  criterion  in  this 
day  and  age  (in  employee  selection,  but  by  extension  to  the  problem  of 
treatment  selection  in  0.0.)  must  include  demonstrable  connection  between 
the  measures  used  and  the  operations  or  functions  performed  in  the  real 
organization. 

These  facts  lead  us  to  the  following  preliminary  conclusions,  which 
form  a starting  point  for  the  research  to  be  undertaken  in  the  present 
report: 

. The  base  of  scientific  knowledge  which  undergirds 
organizational  development,  while  it  is  growing 
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rapidly,  is  still  remarkably  small. 
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. Much  of  what  is  done  is  based  upon  consultants' 
predilections  or  fads,  not  upon  sov’d  reasons 
diagnostically  generated. 

. There  is  as  yet  little  that  could  really  be 
termed  rigorous  diagnosis  practiced  within  the 
O.D.  profession. 

. Here,  as  elsewhere,  statistical  prediction  is 
likely  to  prove  far  more  accurate  than  clinical, 
or  clinically  mediated,  prediction. 

. Raw  empiricism,  in  the  form  of  predictors  not 
obviously  related  to  the  processes  and  functions 
being  diagnosed,  no  matter  how  seemingly  accurate, 
are  no  longer  societal ly  acceptable.  Prediction 
must  be  based  upon  measures  derivable  from  solid 
scientific  evidence  about  organizational 
functioning. 

To  understand  what  is  or  must  be  involved  in  diagnosis,  v/e  turn  to 
a field  which  has  practiced  and  taught  diagnosis  for  years  and  decades, 
or  even  centuries:  medicine.  Ledley  and  Lusted  (1959),  in  what  must 
be  counted  as  a seminal  article,  dealt  at  some  length  with  the  reasoning 
foundations  of  medical  diagnosis.  Exhibit  1 presents  a few  of  the  principal 
points  which  they  make,  along  with  organizational  diagnostic  analogs. 

In  the  next  sections  we  present  a brief  discussion  of  the  content  of  each 
point. 


ISSUES  IN  DIAGNOSTIC  REASONING 
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Symptoms  and  Disorders 

The  total  pool  of  available  characteristics  (of  client  units)  is 
at  any  time  limited  to  those  which  our  knowledge  base  contains  some  infor- 
mation about  and  which  our  measurement  methods  are  capable  of  measuring. 
All  available  characteristics  are,  at  some  level  on  their  respective 
scales,  potential  symptoms.  Whether  they  are,  in  fact,  regarded  as 
"symptoms"  or  not  depends  upon  what  past  research  and  experience  has  found 
to  be  true  --  that  is,  what  has  been  added  to  the  knowledge  base. 

What,  then,  are  diseases,  disorders,  or  states  of  organizational 
dysfunction?  A disease  is  a hypothetical  construct  — a theoretical 
term  used  for  convenience  purposes  to  refer  to  a whole  chain  of  physical 
events  which  are  hypothesized  as  having  occurred.  "Proof"  that  the 
hypothesized  sequence  has  occurred  (or  is  occurring)  is  obtained  by  some 
form  of  validation  process.  This  validation  can  be  concurrent  or  even 

retrospective:  if  little  Johnny  has  influenza,  he  should  display 
particular  additional  characteristics  or  should  have  displayed  them  within 
the  last  24  to  48  hours.  It  can  also  take  the  form  of  construct 
validation,  that  is,  of  showing  that  only  those  observables  that  are 
hypothesized  as  going  together  in  fact  appear.  Finally,  the  validation 
process  can  be  predictive:  we  can  wait  to  see  whether  subsequent, 
predicted  signs  of  influenza  appear  in  little  Johnny's  case.  Througtiout 
this  sequence  of  comparisons,  however,  "influenza"  is  a hypothetical 
sequence  of  events  which  we  presume  to  be  able  to  see  specific  signs 
of  at  specific  points  in  time.  Its  excellence  as  a classification 
category  at  any  given  point  in  the  profession's  development  is  entirely 
dependent  upon  the  quality  and  completeness  of  the  knowledge  base  from 
which  we  work,  as  it  relates  to  the  distinctions  between  this  category 
and  others. 
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What,  then,  determines  what  a disease  is?  It  is  the  generalization 
and  codification  processes  which  past  knowledge  generators  have  gone 
through  in  integrating  the  findings  from  research  and  experience. 

> Diagnostic  procedures  which  rely  upon  "expert"  assignment  to  diagnostic 

categories  simply  substitute  the  expert  clinician  for  more  public  and 
repli eatable  listings.  If  the  experts'  procedures  are  unreliable,  their 
classification  is,  as  a criterion,  worthless.  If  they  are  reliable  and 
valid,  it  is  a valuable  aid  — a shortcut  to  employing  the  knowledge 
base  directly  and  in  its  entirety. 

Regardless  of  the  way  in  which  we  mediate  the  process  by  which  the 
knowledge  base's  contents  get  represented,  the  disease,  disorder,  or 
dysfunction  is  nothing  other  than  a string  of  symptoms  very  much  like 
those  which  we  look  at  in  any  particular  case.  It  is  to  this  hypothetical 
symptom  string  or  pattern  that  comparison  is  made  in  a diagnosis. 


Diagnosis  as  Probability  Statements 

In  organizational  development  and  change,  the  diagnostic  process 
follows  essentially  this  same  pattern.  Symptoms  are  organizational 
characteristics  which  past  research  indicates  go  together  to  define  some 
more  general  statement  of  organizational  health  or  dysfunction.  That 
our  "diseases"  do  not  have  exotic  names  in  Latin  should  not  dismay  us. 
Perhaps  the  absence  of  names  at  all  is  an  advantage,  in  fact.  Certainly 
there  have  been  fewer  years  and  resources  available  as  yet  for  the 
codification  of  the  knowledge  base,  and  our  professional  schools  teach 
us  to  be  hesitant,  cautious,  and  qualifying  in  our  statements,  rather 
than  authoritative,  definitive,  and  final.  These  are  issues  of  style, 
however,  rather  than  substance.  The  fact  remains  that  there  is  an 
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existing  knowledge  base,  comparison  to  which  permits  us  to  make  a 
probability  statement  concerning  any  case  at  hand.  Here,  as  in  medicine, 
a diagnostic  statement  is  a "best  guess." 

Relevance  to  Treatment  Selection 

The  whole  purpose  of  a diagnosis  is  to  permit  the  selection  of  an 
optimum  treatment  or  intervention.  Here,  as  in  the  case  of  medicine, 
such  choices  are  subject  to  social,  ethical,  economic,  and  moral 
constraints  imposed  by  the  society  in  which  we  live.  Certain  interventions 
may  be  socially  unacceptable  or  even  morally  offensive.  For  example, 
intensely  confrontational  techniques  are  clearly  unacceptable  in  many 

more  traditional  organizational  settings,  and  under  certain  circum- 
stances it  is  conceivable  that  top  management  team  development  training 
might  generate  an  in-group  clubbiness  whose  effects  are  racially 
discriminatory  ard  therefore  morally  offensive.  Other  interventions, 
no  matter  how  appropriate  and  promising,  might  be  so  expensive  as  to  be 
prohibitive,  while  still  others  that  would  solve  the  problem  might 
lead  to  violations  of  privacy  and  confidentiality  which  must  be  judged 
to  be  unethical. 

However,  within  the  limits  which  these  constraints  impose,  the 

problem  becomes  one  of  selecting  an  optimum  treatment  from  a pool  of 

\ 

those  available.  What  is  optimum?  Ledley  and  Lusted  (1959)  turn  to 
value  (decision)  theory  in  an  attempt  to  answer  this  question.  Bowers 
and  Hausser  (1977)  have  shown  ho w the  organizational  development  problem 
can  itself  be  cast  into  these  same  terms,  and  have  presented  empirical 
evidence  about  a limited  number  of  intervention  strategies. 
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A diagnostic  procedure  which  clearly  differentiates  cases  to  which 
each  of  the  known  and  available  interventions  are  appropriate  would 
obviously  be  superior  to  one  which,  in  some  measure  or  other,  was  unable 
to  distinguish  a condition  calling  for  one  intervention  from  a condition 
calling  for  another.  At  the  most  undesirable  extreme  would  be  a 
"diagnostic"  procedure  whose  conclusions  lead  always  to  the  same 
treatment  or  intervention,  a condition  which  Levinson  (1972)  implied 
occurs  in  organizational  development  all  too  frequently. 

Role  of  the  Knowledge  Base 

Even  with  a relatively  simple  rating  system  of  "Yes-No"  or 
"Present-Absent,"  a list  of  N possible  characteristics  produces  2W 
potential  combinations.  The  number  of  potential  "diseases"  or  dysfunc- 
tional states  --  represented  by  the  number  of  cells  in  an  N-dimensional 
lattice  — is  obviously  unrealistically  large.  In  any  comprehensive 
scheme,  all  of  the  available  units  in  the  world  probably  would  be 
insufficient  to  providing  a single  case  per  cell.  The  equally  obvious 
conclusion  is  that  most  cells  are  empty,  that  they  represent  nonexistent 
disorders,  and  that  only  a relative  few  comprise  the  set  of  "real" 
possibilities.  It  is  the  task  of  the  knowledge  base  to  provide  us  with 
current,  accurate  information  about  what  these  possibilities  are. 

Much  the  same  point  is  made  in  the  theory  of  adaptation  in 
natural  and  artificial  systems  (Holland,  1975).  Combinations  of 
characteristics  rapidly  generate  astronomical  numbers  of  possibly 
adaptive  structures.  If  the  organism  or  system  were  to  choose  an 
enumerative  adaptive  plan  --  simply  running  down  the  list  randomly 
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until  it  found  the  one  that  worked  --  adaptation  would  rapidly  become 
impossible.  As  the  writer  just  cited  indicates,  given  even  the  fastest 
computers  in  existence,  it  would  require  "a  time  vastly  exceeding  the  age  of 
the  universe' to  test  lO100  structures."  Instead,  adaptive  plans  to  be 
feasible  must  be  robust  — that  is,  they  must  be  efficient  over  the  range 
of  situations  which  will  be  encountered.  One  general  requirement, 
therefore,  is  that  an  adaptive  plan  must  retain  advances  already  made, 
as  well  as  parts  of  the  history  of  what  has  occurred.  This  information, 
of  course,  is  what  constitutes  the  knowledge  base  in  any  diagnostic  system. 


Improvement  Probabilities  and  the  Single  Case 

At  first  blush,  the  statement  seems  unfeeling  or  insensitive  that 
we  maximize  the  probability  of  any  individual  unit's  shaving  improvement 
when  we  apply  to  it  a strategy  shown  to  maximize  the  number  of  uni ts 
showing  improvement.  Organizational  development  is,  after  all,  a human 
practice  profession,  and  it  seems  impgssible  to  ignore  facts  obviously  at 
hand  (within  range  of  our  personal  observations,  for  example). 

Nevertheless,  observations  based  upon  an  N of  one  (observer), 
collected  under  atypical  conditions  and  within  nonrepresentati vely  short 
time  frames,  are  no  more  reliable  and  accurate  when  taken  singly  than 
they  would  be  if  used  en  masse  for  large  numbers  of  cases. 

This  issue  was  touched  upon  by  one  of  the  present  writers  in  an 
earlier  report:  "Even  the  most  accurate  diagnosis  may  suffer  from  mid- 
stream or  horseback  revisions  made  oy  the  consultant  as  he  approaches  its 
use.  Basically,  any  data  collection  and  analysis  method  treats  with  some 
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degree  of  care  and  accuracy  a portion,  but  not  all,  of  the  behaviors, 
events,  and  issues  in  the  life  space  of  the  client  system.  Some  portion 
is  unique  to  that  system,  or  to  any  group  within  it,  or  will  have  been 
excluded  from  the  array  of  information  categories  designed  in  the  diagnostic 
process  at  its  inception.  As  the  consultant  approaches  a particular  unit 
or  group  of  the  client  system,  he  will  necessarily  see  other  aspects  of 
what  he  feels  are  its  functioning  not  represented  in  the  diagnosis  which 
he  has  in  hand.  Since  he  is  dealing  with  a real  client,  in  a real  world 
situation,  the  temptation  is  well  nigh  irresi stable  to  revise  the  diagnosis 
on  the  basis  of  his  current  observation.  Yet,  he  is  one  observer  observing 
at  best  a limited  and  time-bound  behavior  sample.  To  the  extent  that 
he  makes  such  revisions  he  therefore  very  likely  reduces  both  the 
reliability  and  the  validity  of  the  diagnosis  with  which  he  works.  Said 
otherwise,  he  approaches  each  group,  or  each  setting,  as  a unique  instance 
with  live  people  and  real  problems.  Yet  in  many  ways  the  diagnosis  and 
treatment  problem  in  organizational  development  is  a "large  N"  problem. 

Were  he  to  work  on  the  basis  of  the  diagnostic  data  provided  to  him  and 
that  alone,  given  that  it  is  reliable  and  valid,  he  would,  across  a large 
nuntjer  of  cases,  succeed  in  a high  portion  (assuming  that  the  diagnostic 
and  prescription  processes  are  themselves  high  in  quality,  reliable,  and 
valid).  Yet  he  does  not  ordinarily  approach  his  role  with  that  degree 
of  objective  detachment,  and  each  time  that  he  yields  to  the  temptation 
to  revise  on  the  basis  of  "current  reality"  he  submits  himself  to  a situation 
in  which  his  action  steps  are  based  on  less  than  acceptably  reliable  and 
valid  data."  (Bowers,  1974) 
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Toward  Relevant  Research 

Clearly,  therefore,  any  attempt  to  develop  and  test  more  rigorous 
diagnostic  procedures  in  organizational  development  should  be  based  upon 
a model  containing  principles  of  organizational  functioning.  In  other 
words,  it  should  be  theoretically  anchored  to  a conceptual  statement  that 
is  itself  both  organizational  in  content  and  comprehensive  in  scope. 

While  the  literature  on  organizational  management  is  ripe  with  theoretical 
statements,  most  of  them  do  not  meet  criteria  of  acceptability  for  our 
present  purposes.  Many  must  be  dismissed  as  less  comprehensive  than  is 
necessary  for  the  present  problem:  that  is,  they  are  elegant  treatments 
of  an  isolated  issue  such  as  job  design  or  individual  satisfaction  or 
leadership,  but  tney  ignore  other  areas.  Others  may  be  rejected  because, 
although  they  encompass  most  of  the  domain,  they  are  lacking  in  adequate 
empirical  underpinnings.  However,  cne  theoretical  statement  which  does 
appear  to  meet  the  criteria  just  outlined  is  the  Likert  meta-theoretical 
paradigm  (Likert,  1961,  1967,  1976;  Bowers,  1976).  It  is  this  theoretical 
statement  which  underlies  the  measures  collected  in  the  data  bank  to  be 
used  in  the  research  represented  in  this  project. 

Most  recent  evidence  suggests  that  this  paradigm  assumes  the  form 
taken  in  Figure  1 (Bowers  & Franklin,  1977).  As  a set  of  principles,  this 
paradigm  would  appear  to  satisfy  the  criteria  of  comprehensi ve  and 
evidential  validity  (Bowers  & Franklin,  1977;  Likert,  1977).  It  is 
operationalized  here  in  the  form  of  the  Survey  of  Organizations,  a machine- 
scored  standardized  questionnaire  which  has  been  used  in  various  editions 
since  1966  to  collect  organizational  survey  data  for  assessment,  feedback, 
and  benchmark  purposes  (Taylor  & bowers,  1972).  Portions  of  these 


Figure  1 

ORGANIZATIONAL  FUNCTIONING 
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banked  data  have  been  used  in  earlier  research  efforts  related  to 
organizational  development.  In  this  regard,  a method  of  diagnostic 
classification  was  previously  developed  and  preliminarily  tested. 

Termed  CANOPUS,  it  contains  a software  package  designed  to  generate  a 
diagnostic  statement  for  groups  and  pyramids  of  groups  comprising 
organizations  (Bowers,  1974).  This  classification  method  is  based  upon 
a typology  of  work  groups  developed  in  the  course  of  prior  research. 

The  technique  used  for  the  development  of  the  typology  was  profile 
analysis,  in  which  one  arrives  at  a clustering  of  work  groups.  The 
profile  consisted  of  a group's  scores  on  the  S00  indexes  and  as  a 
profile  reflected  three  basic  kinds  of  information:  level,  dispersion, 
and  shape.  Level  was  the  mean  score  of  the  work  group  over  the  indexes 
in  the  profile;  dispersion  reflected  how  widely  scores  in  the  profile 
diverged  from  the  average;  and  shape  concerned  the  profile's  high  and 
low  points. 

A measure  of  profile  similarity  that  takes  shape,  level  and  dis- 
persion into  account  is  the  distance  measure.  If  one  considers  a person 
(or  group)  as  a point  in  a multidimensional  space  in  which  each  dimension 
represents  a variable  or  index,  then  the  distance  betv/een  two  points, 
that  is,  persons  (or  groups),  can  be  computed  using  the  generalized 
Pythagorean  theorem.  The  distances  can  then  be  examined  to  determine 
which  groups  cluster  together  in  that  multidimensional  space. 

The  clustering  technique,  called  hierarchical  grouping,  uses  this 
distance  measure  as  a measure  of  profile  similarity.  Computer  software 
is  available  for  this  technique  in  the  program,  HGROUP  (Veldman,  1967). 
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This  program  begins  by  considering  each  original  object,  in  this  case 
a work  group,  of  those  to  be  clustered,  as  a cluster.  These  N clusters 
are  then  reduced  in  number  by  a series  of  step  decisions  until  all  N 
objects  have  been  classified  into  one  or  the  other  of  two  clusters. 

At  each  step  the  number  of  clusters  is  reduced  by  one  through  combining 
some  pair  of  clusters.  The  particular  pair  to  be  combined  at  any  step 
is  determined  by  the  computer's  examining  all  the  available  combinations 
and  choosing  the  one  which  minimally  increases  the  total  variance  within 
clusters.  It  is  this  latter  minimizing  function  that  utilizes  the  dis- 
tance notion.  The  total  variance  within  clusters  is  a measure  of  the 
closeness  of  the  points  in  multivariate  space  in  clusters  already 
chosen.  A substantial  increase  in  this  variance,  which  the  HGROUP 
program  labels  an  error  term,  indicates  that  the  previous  number  of  clusters 
is  probably  optimal  for  the  original  set  of  objects  or  work  groups. 

The  program  provides  an  identification  of  those  groups  contained  in  each 
cluster  so  that  further  analyses  can  be  conducted  on  phenomena  within 
clusters. 

The  HGROUP  program  was  applied  to  three  random  subsamples  drawn  - , 
from  the  data  bank  (Hausser  & Bowers,  1977  ).  When  the  three  sets  of 
data  were  considered  jointly,  a total  of  17  distinct  profiles  emerged. 

In  many  ways,  this  software  system  would  appear  to  meet  generally  the 
requirements  listed: 

. It  compares  data  to  appropriate  norms. 

. Problems  once  identified  are  prioritized  in  terms  of 
their  potential  impact  upon  outcomes. 

. It  seeks  causes  for  observed  conditions  among  situa- 
tional, information,  skill  and  values  conflict 
predictors,  employing  a distance  statistic. 
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. It  selects  a broad  set  of  potentially  appropriate 
action  steps  from  an  array  of  possibilities. 

It  converts  the  whole  and  its  parts  into  a readable 
narrative  by  computerized  text-writing. 

Still,  the  outcome  of  the  method  is  based  upon  the  measures,  and  those 
measures  derive  from  the  theoretical  paradigm  previously  cited.  While 
attractive,  it  is  but  one  of  several  statements  that  might  have  formed 
the  basis  for  operations  and  measures.  Clearly  some  difference  among 
theorists  is  to  be  expected.  The  domain  is  sliced  differently,  and  the 
terms  applied  to  collections  or  clusters  of  behaviors  and  processes  will 
vary  substantially.  However,  if  the  fundamental,  general  algorithm  is  the 
same,  we  can  at  least  be  somewnat  reassured  that  subsequent  work  will 
not  be  unacceptably  parochial. 

In  an  effort  to  address  this  question,  in  an  earlier  report  the 
writings  of  nearly  30  prominent  persons  In  the  organizational  management 
field  were  examined.  (Bowers,  Davenport  & Wheeler,  1977)  The  conclusion 
from  this  examination  of  the  field  was  that  we  can  be  reasonably  confident 
that  a common  algorithm  underlies  most  of  the  major  works  in  it.  It 
leaves  us  reassured  that  adherence  to  an  alternative  formulation  from  this 
list,  if  pursued  to  its  most  basic  form,  would  not  result  in  an  utterly 
different  diagnostic  scheme.  Ten, is  might  be  different,  and  the  operations 
employed  by  each  writer  to  measure  particular  sets  of  variables  might  vary 
widely,  but  the  rationale  and  the  set  of  primitive  constructs  would  be 
very  much  the  same. 
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A Stock-Taking  and  Some  Implications 

Aqainst  the  expressed  need  for  improved  methods  for  diaqnosis  in 
organizational  development  we  can  array  the  following  major  points  from 
the  preceding  discussion.  An  adequate  diagnostic  procedure  necessitates: 

(1)  A theoretical  model  which  is  acceptably  comprehensive, 
which  shares  the  same  general  algorithm  present  in  the 
array  of  principal  alternative  formulations. 

(2)  A bank  of  data,  collected  by  a standardized  instrument 
in  a wide  variety  of  organizational  settings,  both 

mi li tary  and  ci vi li  an. 

(3)  A recognition  that  accuracy  in  diagnosis  will,  here  as 
elsewhere,  very  likely  be  enhanced  by  statistical  opera- 
tions rather  than  clinical  judgment. 

(4)  An  acknowledgment  of  societal  requirements  rejecting  "raw" 
empiricism  in  favor  of  statistical  procedures  and  measures 
which  are  content  valid. 

Considering  the  magnitude  of  the  problem,  the  size  of  data  sets, 
and  the  turnaround  time  requirements  present  in  most  organization  develop- 
ment situations,  yet  another  requirement  would  appear  to  be  present: 
that  whatever  operations  result  be  computer-assisted.  In  this  area,  one 
can  profit  from  the  experience  of  another  practice-oriented  profession 
whose  researchers  have  explored  computerized  diagnostic  procedures, 
psychiatry. 
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There,  as  elsewhere,  substantial  difference  of  opinion  exists 
concerning  the  best  method  of  evaluating  the  importance  of  symptoms. 

Two  general  types  of  models  relying  upon  probability  statistics  have 
been  proposed: 

A.  A discriminant  function  model,  in  which  each  symptom  is 

given  an  empirically  derived  weight,  and  an  artificial  measure 
is  then  obtained  as  the  sum  of  the  weighted  values 
(Crooks,  Murray  A Wayne,  1959). 

The  arguments  in  favor  of  a muHiple  discriminant  model  are  at  least 
threefold : 

(1)  it  better  renlicates  the  thought  process  employed  by  the 
human  diagnostician,  who  does  not  treat  each  symptom  in 
present/absent  fashion,  but  rather  attaches  greater  or 
less  weight  to  each  symptom  according  to  past  experience 
(i.e.,  his/her  own  version  of  the  knowledge  base.) 

(2)  Symptoms  which  correlate  highly  with  the  presence/absence 
of  a disease  are  given  more  weight  than  those  that  have 
showr.  little  or  no  correlation  to  its  presence/absence. 

(3)  Appropriate  weighting  also  depends  upon  a symptom's 
correlation  with  other  symptoms.  If  the  overlap  is  high, 
then  one  would  weight  the  second  symptom  much  lower  than 
would  be  the  case  if  the  two  symptoms  have  little  relation- 
ship to  each  other. 
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At  least  two  objections  have  been  raised  to  this  method: 

(1)  It  relies  upon  the  accumulation  of  a large  developmental 
sample  of  cases,  which  is  difficult,  expensive,  and 

in  most  instances  unlikely. 

(2)  It  capitalizes  upon  accidental  features  of  the  develop- 
mental sample  and  thus  gives  an  inflated  estimate  of 
its  accuracy.  If  the  validation  sample  comes  from  a 
somewhat  different  population,  the  drop  in  efficacy 

is  even  greater. 

B.  A Bayesian  or  frequency-count  model,  in  which  the  relative 
frequency  of  occurrence  of  each  possible  symptom-disease 
pattern  is  considered  (Ledley  & Lusted,  1959). 

The  principal  argument  raised  in  favor  of  a Bayesian  approach  to 
computerized  diagnosis  appears  to  be  that  it  also  is  claimed  to  model  the 
human  judgment  process  by  which  symptoms  are  converted  into  a diagnostic 
statement  (that  is,  that  the  physician,  for  example,  employs  a conditional 
probability  judgment  process  in  arriving  at  a diagnosis.) 

The  objections  are  a bit  more  extensive: 

(1)  It  is  difficult,  if  not  impossible,  in  diagnostic  work  to 
satisfy  the  conditional  independence  requirement  (the  require- 
ment that  the  probability  of  finding  one  particular  symptom 
given  that  the  disease  is  present,  is  unaffected  by  the 
presence  or  absence  of  any  other  symptom.) 

(2)  As  in  the  other  statistical  method,  it  requires  the 
accumulation  of  a large  developmental  sample  of  appropriate 


form  and  content. 
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(3)  The  necessary  assumption  that  the  diseases  are  muUally 
exclusive  may  not  hold. 

(4)  As  in  the  other  statistical  method,  it  capitalizes  upon 
accidental  features  of  the  developmental  sample. 

To  these  have  been  added  a third  method  which: 

C.  Treats  the  issue  as  a decision-tree  problem,  thus  relying 
maximally  upon  excellence  of  the  knowledge  base  and  not  at 
all  upon  probabilities  in  a developmental  sample  (Spitzer 
& Endicott,  1968). 

The  arguments  in  favor  of  this  method  are  given  by  at  least  one 
proponent  (Spitzer,  et  al , 1974)  as  the  following: 

(1)  It  is  independent  of  any  specific  body  of  data;  that  is,  it 
does  not  require  a large  developmental  sample. 

(2)  It  is  not  constructed  so  as  to  be  optimal  for  any  one 
population  and  for  this  reason  "travels  well"  from  one  setting 
to  another. 

(3)  As  in  the  case  of  each  of  the  other  two  methods,  it  is  thought 
to  represent  optimally  the  thought  processes  of  the  human 
diagnostician. 

The  objections  are  the  following: 

(1)  It  is  quite  dependent  upon  the  accuracy  of  tne  theory  which 
underlies  trie  decision  tree  and  is  therefore  ultimately  as 
dependent  as  the  other  methods  upon  past  data  accumulations, 
their  care  and  form. 

(2)  Its  general i zabi li ty  may  be  more  apparent  than  real. 

(3)  Its  assumption  that  the  diseases  are  mutually  exclusive 
may  not  hold. 

Vi 
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Comparison  of  the  Three  Methods 

Several  efforts  have  been  undertaken  in  psychiatry  to  compare  two  or 
more  of  these  methods  empirically.  The  results  are  best  described  as 
decidedly  unclear.  Overall  and  Hollister  (1964)  conducted  one  such  com- 
parison, but,  unfortunately,  the  rules  used  by  their  computer  programs 
were  obtained  from  diagnostic  stereotypes  provided  by  experts,  rather  than 
from  observed  characteristics  of  actual  cases. 

Melrose,  et  al.  (1970)  compared  a multiple  discriminant  with  a 
decision- tree  approach  and  found  that:  (a)  on  single  assignments  the 
decision- tree  approach  showed  a greater  degree  of  agreement  with  an  expert 
judgment  criterion;  (b)  if  first,  second,  or  third  possible  assignments 
were  allowed,  the  multiple  discriminant  method  showed  a greater  degree  of 
agreement  than  did  the  decision-tree;  and  (c)  in  any  event,  each  method 
performed  better  for  certain  diagnostic  categories. 

Finally,  Fleiss,  et  al.  (1972)  compared  all  three  methods  snd  found 
none  of  the  three  to  be  clearly  superior  to  the  other  two.  Again,  however, 
the  criterion  was  agreement  with  expert  diagnoses,  a criterion  whose 
unreliability  the  authors  duly  note. 


A Dilemma  and  Some  Issues 

The  work  from  psychiatry,  just  cited,  contains  a dilemma  whose  existence 
questions  the  whole  body  of  findings  and  whose  resolution  might  be  seen 
as  rendering  the  whole  exercise  rather  trivial.  Elegant,  repli eatable , 
and  readily  transportable  methods  are  designed  and  tested  against  a 
criterion  of  "judgment  by  expert  clinicians."  Yet  here,  as  elsewhere,  the 
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Cronbach  warnings  apply:  expert  clinical  judgment  is  notoriously  unreliable 
What  has  been  developed,  therefore,  are  three  elegant  ways  of  replicating 
an  unreliable  procedure.  On  the  other  hand,  had  a reliable,  repl icatable, 
transportable  procedure  existed  for  use  as  a criterion,  it  would  no  doubt 
have  been  more  sensible  to  employ  it  as  Uie  diagnostic  method,  rather  than 
as  a criterion  for  other  methods . 

Several  implications  stem  from  this  observation.  First,  where  in 
psychiatry  expert  clinical  judgment  is  an  unreliable  classification  method 
and  criterion,  in  the  present  instance  we  do  possess  a reliable,  verifiable 
procedure,  one  based  upon  the  distance  statistic.  An  issue  of  some 
importance  lies  in  these  facts,  that  a distance  function  was  used  to 
generate  the  original  typology  and  that  at  least  the  discriminant  function 
bears  a close  kinship  to  that  method  --  closer  than  does  the  decision 
tree  procedure,  for  example.  This  breaxs  down  into  two  component  questions: 
(a)  to  what  extent  is  the  typology  an  artifact  of  the  method,  and  (b)  to 
what  extent  are  our  findings  simply  evidence  of  the  fact  that  nothing 
can  approximate  a distance  function  like  a distance  function? 

To  address  this,  let  us  suppose  that  the  total  population  of  approxi- 
mately 7,000  groups  were  uniformly  distributed  in  the  14  space  represented 
by  the  measures.  To  generate  the  typology,  we  originally  selected  three 
samples  totalling  533  groups.  We  found  that  the  clustering  procedure 
classified  these  into  17  clusters.  Was  that  clustering  a chance  outcome, 
a "forcing"  into  clusters?  The  original  results  suggest  that  the  probability 
of  that  was  extremely  low.  Still,  let  us  suppose  that  it  was  that  once 
in  50  or  100  times  when  such  a result  would  occur  by  chance.  What  would 
then  be  the  result  of  attempting  to  fit  the  entire  population  to  the  types 
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so  generated?  Obviouslyv  we  would  not  expect  a fit  to  occur  --  by  a 
distance  function  or  any  other  function. 

Alternatively,  one  may  approach  the  problem  from  a conceptual, 
rather  than  an  empirical,  viewpoint.  Let  us  assume  for  the  moment  that 
the  typology  is  real  --  that  these  really  do  exist  17  types  which  have 
true  values  on  each  of  the  14  dimensions.  In  the  original  study  we 
identified  that  they  do  exist  and  then  estimated  their  values  by  calculating 
the  mean  index  scores  for  the  defining  clusters.  Our  estimates  may  be 
in  some  measure  discrepant  from  those  true  values.  Regardless  of  the  fact 
that  a distance  function  was  used  to  define  the  clusters,  the  profile 
estimates  differ  from  the  true  values  on  one  or  more  of  the  three  con- 
stituent elements  of  distance:  level,  dispersion,  and  shape.  In  other 
words,  our  estimate  of  the  type’s  profile  value  may  differ  from  the  true 
value  because  it  is  too  high  or  too  low  (level),  because  the  component 
indexes  are  too  compact  or  too  spread  out  (dispersion),  or  because  the  rank 
order  of  indexes  in  the  profile  is  out  of  order  (shape.) 

Pulling  these  alternative  lines  of  thought  together,  if  a typology 
does  truly  exist,  and  we  have  closely  estimated  the  profile  values 
involved,  it  matters  little  that  we  did  so  by  a method  that  clusters 
according  to  a distance  function.  The  profiles  are  for  all  intents  and 
purposes  those  of  the  true  types.  We  can  then  judge  the  accuracy  of  the 
distance  function  --  or  the  discriminant  function  or  the  decision  tree 
method  — by  the  tightness  with  which  that  method  is  able  to  align  the 
population  to  the  "true"  types,  in  part,  therefore,  we  do  two  things 
simultaneously  when  we  classify  all  cases  in  the  population  by  a distance 
function: 


(a)  we  assess  the  plausibility  of  the  original  typology 
itself,  and 

(b)  if  the  typology  proves  to  be  plausible,  we  assess  the 
degree  (compared  to  other  methods)  to  which  a 
distance  function  itself  is  effective  in  classifying 
cases. 

However,  above  and  beyond  this  issue,  there  are  other  issues  of 
equal  or  greater  importance,  the  answers  to  which  are  in  no  way  obvious. 
These  issues  of  evaluating  classification  procedures  include: 

. Accuracy  Based  Criteria 

. Proportion  of  correct  classification 
. Typology  reproduction 
. Weighted  dimensional  distances 
. Zero-one  counts 
. Severity  of  misclassification 
. Non-Accuracy  Based  Criteria 

. Information  required  to  make  a decision 
..  Amount  of  data  required  to  develop  the  diagnostic 
process 

. Ease  of  calculation 

Of  these,  one  — weighted  dimensional  distances  — seemed,  and  in  fact 
proved,  to  be  beyond  the  scope  and  resources  of  the  present  project.  The 
remaining  seven  issues  were  Investigated.  A brief  description  of  them  is 
presented  in  the  following  sections. 
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Accuracy  Based  Criteria 

The  purpose  of  this  section  is  to  present  four  different  criteria 
to  use  in  evaluating  classification  procedures,  each  of  which  is  based  on 
a measure  of  accuracy.  In  later  sections,  non-accuracy  based  criteria 
are  discussed. 

The  four  different  classification  procedures  to  be  investigated  are 
a decision  tree  (DT),  multiple  discriminant  function  (MDF),  a Bayes  rule  (B), 
and  a straight  (least  squares)  distance  function  (DF).  The  typology  into 
which  work  groups  are  to  be  assigned  has  been  discussed  in  earlier  reports 
(Bowers,  Davenport  & Wheeler,  1577).  Two  aspects  of  the  typology  and  its 
development  are  relevant  to  the  investigation  here.  First,  it  has  been 
derived  by  a clustering  algorithm  which,  in  effect,  grcups  the  observations 
(vectors  of  mean  scores  from  work  groups)  so  as  to  minimize  the  variance 
within  clusters.  The  variance  metric  is  also  a distance  metric,  which  will 
result  in  the  same  classification  by  the  DF  for  a particular  work  group 
as  you  would  get  by  including  that  work  group  in  the  clustering  process 
originally.  The  consequence  is  that  we  will  use  the  distance  function  (DF) 
classification  as  the  correct  classification  for  any  particular  work  g*-oup. 

The  typology  developed  by  the  clustering  algorithm  allows  the  creation 
of  a vector  of  scores  from  the  averages  across  all  work  groups  within 
the  cluster  (Bowers  & Hausser,  1975).  Thus,  we  can  think  of  the  typology 
containing  17  types  as  a set  of  17  vectors.  That  is,  a type  is  represented 
by  a single  vector  of  scores. 
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Proportion  of  Correct  Classification 

The  classical  criterion  for  evaluating  classification  schemes  is  the 
proportion  of  agreement  with  some  external  "expert"  opinion.  In  the 
present  situation,  as  discussed  above,  the  expert  opinion  will  be  that 
provided  by  the  distance  function  (DF).  However,  there  is  some  interest 
in  seeing  how  well  one  data  analytic  technique  is  able  to  reproduce  another 
data  analytic  technique  when  additional  criteria  (beyond  proportion  of 
agreement)  are  considered.  The  reader  is  referred  to  the  following 
sections  for  discussion  of  other  criteria. 

As  a beginning,  we  propose  to  analyze  the  three  classification 
procedures,  MDF,  B,  and  DT , by  comparing  the  proportions  of  agreement  with 
DF.  The  straightforward  process  will  be  as  follows: 

For  each  of  MDF,  B,  and  DT,  compute  the  proportion  of  the 
(approximately)  7,000  work  groups  which  are  assigned  to  the  same 
type  by  both  the  given  procedure  and  DF. 

The  figures  obtained  from  the  above  analysis  can  be  viewed,  for 
each  procedure,  as  the  percentage  correct  within  each  type  of  the  typology 
as  well  as  aggregated  across  all  types.  Such  a review  may  enhance  the 
application  of  the  technique  in  other  settings. 

Keproduoc  the  Typology 

Assume  a particular  classification  procedure  has  classified,  say,  k 
groups  into  a single  type  within  the  typology.  It  then  would  be  possible 
to  compute  the  averages  for  those  k groups,  resulting  in  a single  vector 
of  index  scores.  It  is  desirable  to  have  this  vector  of  scores  for  the 
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assigned  groups  be  close  to  the  vector  of  scores  which  represent  this 
type.  A measure  of  accuracy  of  prediction  would  be  to  compute  the  sum  of 
the  distances  between  each  of  the  17  types  and  their  corresponding 
mean  vectors  from  the  groups  assigned  to  them. 

A criterion  for  evaluating  the  different  classification  schemes,  then, 
would  compare  the  sums  of  distances  between  the  typology  vectors  and  the 
group  means.  Because  of  the  use  of  a standard  distance  measure  to  compare 
the  type  vector  and  the  vector  of  average  scores  from  the  bbservations 
assigned  to  that  type,  the  distance  function  will  do  best  here,  as  well. 
However,  it  would  be  appropriate  to  compute  the  sum  of  the  distances 
for  DF  to  use  as  a standard  against  others  which  could  be  compared. 

The  procedure  to  implement  this  process,  using  all  work  groups, 
is  as  follows: 

For  each  of  MDF,  B,  DT,  and  OF: 

1.  Compute  a vector  of  average  values  for  each  set  assigned  to 
a type  by  the  procedure. 

2.  Compute  the  distance  between  this  vector  of  averages  and  the 
type  vector. 

3.  Sum  the  distances  across  all  types. 

For  each  of  MDF,  B,  and  DT : 

4.  Compute  the  ratio  of  the  sum  to  the  sum  for  DF. 
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Zero-One  Count 

In  assigning  a work  group  to  a single  type  within  the  typology, 
two  different  patterns  of  work  group  scores  may  lead  to  the  same  distance 
from  the  type.  Consider  the  following  example,  simplified  to  reflect 
only  four  dimensions. 

type  (2,2 ,2,1) 

work  group  1 (4, 4, 4, 3)  distance  = 4 

work  group  2 (2, 2, 2, 5)  distance  = 4 

One  approach  to  overcoming  the  above  situation  is  to  use  the  weighted 

dimensional  distances  described  earlier.  An  alternative  approach  is  to 
count  the  number  of  dimensions  for  which  the  assigned  group  is  significantly 
different  from  the  type.  If  we  define  significantly  different  as  being 
greater  than  or  equal  to  two  in  the  above  example,  the  counts  would  be 
four  for  work  group  1 and  one  for  the  second  work  group.  Mathemati cally , 
this  counting  can  be  represented  as  the  sum  of  the  values  c^ , 

fl,  if  |Y.-X.|>s 

c.  =<  and  C = Zc. 

19,  if  |vi-x1|<s.  1 

Here  s is  the  value  defining  significantly  different,  and  and  are  the 
vectors  representing  the  points  of  interest.  Then  C may  be  computed  for  each 
work  qroup  assigned  to  a particular  type,  and  an  average  computed  for  each 
type.  The  choice  of  s should  be  based  on  some  measure  of  relative  magnitude 
or  theoretical  argument.  For  the  present  study,  we  propose  s be  chosen  as 
a measure  of  the  average  variability  of  the  S00  indexes. 

Thus,  the  average  of  C could  serve  as  a criterion  for  evaluating  the 
accuracy  of  classification.  The  procedure  Involved  would  be: 


(1)  Assign  a value  to  s,  the  significant  difference. 

(2)  For  each  procedure,  compute  C for  each  work  group. 

(3)  Compute  CT  for  each  procedure. 
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Severity  of  Mis  classification 

One  argument  against  the  use  of  the  classical  frequency  of  correct 
classifications  is  that  it  treats  all  misclassi fi cati ons  as  equal.  That 
is,  there  is  no  distinction  between  miscl assi fying  a particular  work  group 
into  any  of  the  k-1  incorrect  types  in  a typology  of  k types.  It  is  not 
unusual,  however,  for  the  cost  of  misclassi fi cati on  to  be  widely  different 
across  the  k-1  incorrect  types,  as  well  as  being  contingent  on  the  correct 
type.  For  example,  incorrectly  diagnosing  an  individual  with  a severely 
sprained  ankle  as  having  a broken  ankle  or  having  leukemia  has  different 
costs;  additionally,  incorrectly  diagnosing  an  encephalitis  case  as  a 
broken  ankle  or  leukemia  has  yet  different  costs. 

An  alternative  to  tne  straight  proportion  of  correct  classification 
criterion  of  accuracy  is  one  which  allows  for  differential  costs  of 
miscl assi fi cati on.  In  particular,  we  might  consider  variations  upon 
either  of  two  different  costing  models  for  misclassi fication : 

1.  That  a cost  of  0 be  assigned  to  any  misclassi  fication  of  a 
work  group  into  a category  which  calls  for  (a)  treatment 
if  so  aoes  the  correct  category,  or  (b)  non-treatment  if 
so  does  the  correct  category;  otherwise  the  cost  is  1. 

2.  Tnat  a cost  of  0 be  assigned  to  any  misclassification  of 
work  group  into  a category  which  calls  for  a treatment  which 
is  known  to  have  a positive  effect  on  the  correct  category, 
and  a cost  of  1 if  the  incorrectly  selected  category  calls 

for  a treatment  which  is  known  to  have  no  effect,  or  a negative 
effect,  on  the  correct  category. 
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These  are  only  two  models  of  a vast  number  of  possibilities  for  costing 
misclassi fication.  However,  they  are  reflective  of  the  primary  concerns 
of  misclassification  --  the  application  of  inappropriate  or  harmful 
treatments.  In  the  present  instance,  we  have  chosen  to  use  a variation 
upon  the  second  form  of  costing  --  one  which  takes  both  direction  and 
severity  into  account. 

Bowers  and  Hausser  (1977)  examined  the  effects  of  different  change 
strategies  on  each  of  the  17  groups  of  the  typology.  They  rated  the 
effects  either  as  negative,  neutral,  or  positive.  For  number  1 above, 
it  is  possible  to  define  a work  group  calling  for  treatment.  Otherwise, 
the  work  group  calls  for  no  treatment.  The  results  of  Bowers  and  Hausser 
(1977)  can  also  be  used  to  establish  the  appropriate  pattern  of  costs  for 
number  2 above. 

Non-Accuracy  Based  Criteria 

Information  Required  to  Make  a Decision 

One  criterion  by  which  one  can  compare  diagnostic  techniques  is  the 
information  reauired  to  reach  a diagnosis.  If  two  techniques  perform 
equally  well  when  the  full  set  of  indexes  is  used  in  the  diagnosis,  then 
it  will  be  to  the  diagnostician's  advantage  to  use  the  one  which  requi -“es 
less  information  to  reach  a decision.  Collecting  and  processing  information 
is  costly,  in  terms  of  money,  time,  and  complexity  of  processing.  For 
instance,  if  one  can  obtain  results  using  three  pieces  of  information 
that  are  as  good  as  the  results  using  five  pieces  of  information.  It  will 
clearly  be  advantageous  to  use  only  three  pieces  of  information. 

The  most  obvious  way  of  reducing  the  amount  of  required  information 
is  to  find  an  appropriate  way  to  reduce  the  number  of  indexes  that  are 
used  In  the  diagnostic  process.  There  are  two  ways  of  reducing  the  number 
of  indexes.  The  first  is  to  discard  or  eliminate  indexes  that  have  been 
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shown  to  be  unnecessary  for  the  diagnosis.  The  second  is  to  combine 
several  indexes  into  "super-indexes."  For  instance,  it  might  be  possible 
to  combine  the  four  peer  leadership  indexes  into  a single  index. 

Another  method  of  reducing  the  information  required  to  make  a decision 
is  to  reduce  the  number  of  respondents  proportionately  from  an  organiza- 
tional survey.  To  investigate  this  approach,  we  propose  co  draw  two 
random  samples  (67%  and  33%)  of  individuals  from  the  total  data  set  and 
classify  the  "reduced"  work  groups.  Evaluation  of  the  effect  of  this 
reduction  will  be  the  proportion  of  agreement  in  classification  between 
the  use  of  reduced  data  and  full  data. 

Amount  of  Data  Required,  to  Develop  the  Diagnostic  Process 

Of  interest  to  the  researcher  is  not  only  the  amount  of  data  that  must 
be  processed  to  obtain  a diagnosis,  but  also  the  amount  of  data  required 
to  generate  accurate  diagnostic  processes. 

All  of  the  techniques  to  be  tested  require  some  kind  of  historical 
data  base,  from  which  the  diagnostic  process  is  generated.  In  all 
instances,  larger  data  bases  should  provide  more  accuracy  than  smaller 
ones.  If  one  technique  can  be  generated  quite  accurately  from  a smaller 
data  base  than  another,  that  technique  would  be  preferable,  since  it  would 
be  more  cost  effective. 

The  amount  of  data  necessary  for  the  techniques  will  be  observed 
variously,  ranging  from  sample  sizes  necessary  to  stabilize  estimates  to 
the  general Izabil ity  of  procedures  computed  from  a developmental  sub- 
sample to  the  remaining  sample. 
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Ease  of  Calculation 

The  ideal  diagnosis  technique  is  not  only  accurate,  but  relatively 
simple  to  perform.  In  the  organizational  diagnosis  situation,  the 
diagnostician  may  very  well  be  a cr.ange  agent  who  must  make  treatment 
recommendations  while  on  site  or  otherwise  out  of  contact  with  computer 
faci li ties. 

Three  ease-of-calculation  factors  are: 

1.  Can  be  calculated  on-site  versus  in  a central 
1 ocati on . 

2.  Can  be  done  with  a hand  calculator  or  by  hand  versus 
requiring  EDP  facilities. 

3.  Few  things  to  be  calculated  versus  many  things  to  be  calculated. 

While  not  identical,  these  three  factors  are  obviously  not  completely 
independent  of  one  another. 

The  comparison  of  diagnostic  techniques  on  these  factors  will  be  a 
matter  of  subjective  judgment.  It  is  proposed  that  after  the  optimal 
solutions  have  been  determined  for  each  diagnosis  technique,  the  researchers 
create  a table  indicating  their  assessment  of  each  technique  on  the  ease 
of  calculation  factors.  The  diagnostician  will  then  be  able  to  select 
among  the  techniques  v/hen  ease  cf  calculation  is  an  important  aspect  of 
a project. 
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A Restatement  of  the  Research  Questions 

From  the  concepts  and  discussion  presented  in  the  preceding  pages, 
we  propose  to  examine  findings  germane  to  the  following  questions: 

(1)  Is  there  evidence  that  the  original  typology  holds  up  when 
applied  to  the  entire  population  of  groups;  that  is,  does  it 
appear  to  reflect  reality? 

(2)  If  the  typology  holds  up,  how  effective  is  a distance 
function  in  classifying  by  type? 

(3)  Using  the  distance  function  classification  as  the  criterion, 
how  accurate  are  discriminant  function,  decision  tree,  and 
Bayesian  methods  in  classifying  groups? 

3a.  What  proportion  of  cases  does  each  method  classify 
correctly?  (Proportion  of  correct  classification) 

3b.  How  well  does  each  method  do  in  reproducing  the 

typology;  that  is,  how  close  is  the  vector  of  scores 
for  assigned  groups  to  tne  vector  of  scores  which 
represent  the  type?  (Typology  reproduction) 

3c.  How  do  the  methods  differ  in  number  of  dimensions  and 
which  scores  ire  significantly  different  from  the 
type  values?  (Zero-one  count) 

3d.  How  do  the  methods  differ  in  the  cost  lines  (in  change 
outcomes)  of  misclassifyir.g  groups?  (Severity  of 
misclasslfication) 

(4)  To  what  extent  are  the  methods  abie  to  classify  groups  correctly 
using  reduced  information  sets? 

4a.  Can  data  from  samples  of  respondents  be  used  equally 

well  as  a complete  census  of  group  members  in  classifying 
groups? 
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4b.  Can  less  than  all  indexes  be  used  equally  well  as 
all  indexes  in  classifying  groups? 

(5)  Do  the  methods  differ  with  respect  to  the  amount  of  data 
required  to  develop  the  method? 


(6)  Do  the  methods  differ  in  ease  of  calculation? 
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CLASSIFICATION  TECHNIQUES 

The  original  development  of  the  typology  used  in  the  present  research 
is  reported  by  Bowers  (1975)  and  Bowers  and  Hausser  (1977).  In  effect,  using 
a clustering  algorithm  and  three  different  samples  of  work  groups  (two 
civilian  and  one  military),  they  found  that  17  different  types  resulted. 

The  analysis  was  done  using  the  same  14  S00  indexes  being  used  in  the  present 
research.  The  17  types  that  comprise  the  typology  can  be  thought  of  as 
17  points  in  (14-oimensional ) space. 

The  classification  issue  is  one  of  assignment  of  new  work  groups  to 
these  different  points  (types).  Four  different  procedures  historically 
have  been  used  in  a variety  of  classification  applications.  They  are  briefly 
described  below: 

(1)  Distance  function:  Find  which  type  a new  unit  is  closest  to 
with  respect  to  some  metric  (usually  the  typical  Euclidean 
metric). 

(2)  Decision  tree:  By  asking  a pre-planned  series  of  questions, 
whicn,  depending  on  the  pattern  of  responses,  leads  to  a 
final  state  or  type  within  the  typology. 

(3)  Discriminant  function:  From  a developmental  sample  one  com- 
putes a set  of  weights  to  be  assigned  to  indexes,  resulting  in 
an  equation. for  each  type  in  the  typology.-  By  applying  all 
equations  to  the  score  values  of  a new  unit,  the  unit  is  classi- 

t 

fied  as  belonging  to  that  type  for  which  the  resulting  equation 
value  is  largest. 
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(4)  Bayesian:  Based  on  the  development  of  prior  probabilities  of 
different  score  patterns  for  different  types,  posterior 
probabilities  are  calculated  from  the  score  values  of  a new  unit 
for  each  type  in  the  typology.  The  type  for  which  this  posterior 
probability  is  greatest  is  the  type  to  which  the  new  unit  is 
assigned. 

Research  into  the  effectiveness  of  these  different  procedures  for  the 
purposes  of  medical  diagnosis  is  mixed.  As  indicated  earlier,  different 
studies  have  claimed  success  for  each  of  discriminant  function,  decision 
tree  and  Bayesian  approaches. 

The  present  research  is  to  investigate  the  applicability  of  the 
above  techniques  to  the  field  of  organizational  diagnosis.  The  purpose  of 
this  section  is  to  outline  the  tecnmques  only  as  they  were  applied  herein. 

Later  sections  will  discuss  the  effectiveness  of  the  various  techniques. 

Distance  function 

The  distance  function  classification  procedure  first  computes  the  distance 

between  a new  work  group's  Index  scores  and  each  of  the  17  types.  Tne  work 

group  is  then  assigned  to  the  type  to  which  it  is  closest.  Notationai ly , 

let  ( x i ,x2,.. . ,xm)  be  the  index  scores  for  a new  work  group.  Let  (m.i  m.2,..., 

J » J 

m,m)  be  the  vector  of  values  for  the  jth  type  in  the  typology.  The 

J 

(Euclidean)  distance  from  the  work  group  to  the  jth  type  is  given  by 

D 

The  application  of  this  technique: 

. does  not  Involve  a developmental  sample, 

. Involves  nearly  5C3  calculations,  implying  the  assistance 
of  some  type  of  computer  or  other  programmable  calculating 
equi pment, 
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. does  not  require  a "correct"  classi fication  in  its 
development. 

The  existence  of  a "correct"  classification  is  an  issue  which  makes 
organizational  diagnosis,  in  its  present  state,  different  from  medical  or 
other  diagnosis/classification.  In  these  latter  applications,  either  time 
(such  as  in  predicting  success  or  failure)  or  a board  of  experts  (such 
a group  of  trained  clinicians  deciding  a patient  is  schizophrenic)  create 
the  "correct"  classification.  The  result  is  that  most  numerical  techniques 
are  then  judged  by  their  ability  to  replicate  these  decisions,  that  is, 
classify  into  the  "correct"  category. 

In  the  present  situation,  there  is,  strictly  speaking,  no  externa1 
"correct"  classification  available.  The  investigation  of  the  numerical 
classification  techniques,  then,  is  not  to  compare  them  to  the  "correct" 
classification,  but  to  examine  their  performance  relative  to  each  other. 

The  option  is  to  take  one  of  the  techniques  and  compare  the  others  with 
it.  The  selection  of  this  single  technique  was  made  easier  in  the  present 
situation  by  the  methodology  used  to  develop  the  typology.  The  underlying 
mathematical  approach  was  to  cluster  those  work  groups  which  were  similar 
to  each  other,  their  similarity  being  judged  by  their  Euclidean  distance. 
Thus,  the  typology  itself,  was  based  on  grouping  by  a distance  criterion. 
Thus,  for  this  study,  a new  work  group's  "correct"  classification  was 
considered  to  be  that  which  it  was  assigned  by  the  distance  function. 
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Decision  tree 

The  approach  we  have  labeled  as  decision  tree  is  somewhat  different 
operationally  from  the  classical  application  of  decision  tree  techniques. 
In  the  latter,  a series  of  questions  Is  asked  sequentially,  the  answer 
to  one  indicating  which  question  is  asked  next.  Pictorially,  the  tree 
effect  is  demonstrated  below. 


Figure  2 


Qi  = Question 

E.  = End  state 
J 


In  the  present  application,  a series  of  questions  was  asked  with  the  answer 
to  one  not  affecting  which  question  was  to  be  asked  next.  In  essence,  a 
series  of  yes/no  questions  were  asked,  and  the  number  of  favorable  answers 
recorded.  The  history  of  this  approach  Is  discussed  more  fully  by  Wheeler 
(1978)  and  Bowers,  et  al  (1977). 
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In  particular,  the  approach  herein  can  be  visually  represented  as 
follows : 


Figure  3 


Index 

Type  7 Intervals 

1 

(2.8309,4.0243) 

2 

(2.8768,3.9558) 

3 

(2.2041 ,3.2587) 

4 

(2.3448,3.4100) 

5 

(3.0014,4.0986) 

6 

(2.4958,3.5316) 

7 

(2.3023,3.3471  ) 

8 

(2.3070,3.3290) 

9 

(2.1423,3.1649) 

10 

(2.0692,3.0188) 

11 

(2.4115,3.3771  ) 

12 

(1 .9535,2.8609) 

13 

(2.8355,3.8083) 

14 

(1.5535,2.6925) 

In  the  above  figure,  a new  work  group's  values  for  the  14  indexes  are 
compared  to  the  intervals  given  for  Type  7.  The  number  of  intervals  which 
contain  the  new  index  values  are  recorded,  and  that  becomes  the  fit  for 
Type  7.  The  fit  for  each  of  17  possible  types  is  computed,  and  then  the 
maximum  of  these  fit  values  is  ascertained.  The  classification  rule  is  to 
place  the  new  work  group  in  that  type  for  which  the  fit  is  the  largest. 

The  results  of  this  approach  are  presented  in  the  next  chapter. 

It  is  possible,  it  should  be  noted,  to  insert  the  above  form  into  the 
classical  decision  tree  model  applied  to  each  type.  The  technique,  then, 
is  the  application  of  17  decision  trees.  If,  in  Figure  2,  Question  1 (Qi ) 
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was  whether  the  new  work  group's  first  Index  falls  in  the  appropriate  range 

i 

for  the  first  index,  and  both  Q?  and  Q3  were  to  ask  if  the  second  index  is 
in  the  range  for  the  second  index,  Q4  through  Q„  would  ask  the  same  for  the 
third  index,  etc.  down  through  the  14th  index,  then  the  End  state  (E.) 
would  be  the  fit  for  that  work  group  for  that  type. 

The  last  feature  of  Implementing  this  technique  has  to  do  with  the 
selection  of  the  intervals.  That  is,  for  each  of  the  17  types  and  14 
indexes,  intervals  were  constructed.  While  a variety  of  procedures  might 
be  used,  some  of  which  may  not  be  data  based,  it  was  chosen  to  use  symmetric 
intervals  about  the  ideal  value.  The  distance  about  the  ideal  value  was 
taken  to  be  a certain  number  of  standard  deviation  units.  For  example, 
for  Type  7,  index  #1,  the  ideal  value  is  3.4276,  and  1.5  standard  deviation 
units  is  .5967,  yielding  an  interval  of  (2.8309,4.0243)  (see  Figure  3). 

The  ideal  values  were  taken  to  De  the  index  means  of  the  groups  of  each  type 
resulting  from  the  original  development  of  the  typology. 

To  apply  this  form  of  the  decision  tree  approach,  it 

. does  require  the  establishment  of  intervals,  most  likely 
from  a developmental  sample,  wnich  have  been  "correctly" 
classified, 

. ooes  not  require  involved  mathematical  calculations  in 
its  assignment  of  new  work  groups,  thereby  permitting 
either  hand  or  computer  use. 

Discriminant  Function 

The  discriminant  function  approach  is  a data  based  classification 
procedure.  That  is,  from  a developmental  sample,  a set  of  linear  (or 
quadratic)  functions  of  the  predictor  variables  are  generated  which 
maximally  differentiate  among  the  groups  of  the  typology. 
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To  apply  the  discriminant  function  technique,  it  is  assumed  that  one 
has  a set  of  predictor  variables  that  are  not  highly  interdependent 
(multicol 1 inear) . This  is  exacerbated  by  the  fact  that  this  requirement 
is  to  be  met  wi t hi n the  subgroups  that  represent  the  various  classes  of 
the  typology.  In  the  present  work,  interdependence  between  the  14  indexes 
(predictor  variables)  was  sufficiently  high  that  it  precluded  the  calculation 
of  the  discriminant  functions. 

The  option  of  reducing  the  number  of  predictors  was  explored.  As  may 
be  expected,  the  highest  levels  of  interdependence  were  found  within  the 
three  major  categories  of  indexes:  organizational  climate,  supervisory 
leadership  and  peer  leadership.  The  approach  taken  to  reduce  the  number 
of  indexes  was  to  do  a cluster  analysis  on  the  14  indexes,  clustering  by 
the  degree  of  correlation.  The  indexes  clustered  within  the  three  areas  just 
mentioned.  However,  for  example,  without  taking  all  four  peer  leadership 
indexes  into  a cluster,  those  left  out  wculd  still  correlate  quite  strongly 
with  the  cluster  (.7  to  .85).  Thus  the  decision  was  made  to  form  three 
super-indexes : 

. Organizational  Climate  (Decision  Making  Practices,  Communication 
Flow,  Motivational  Conditions,  Lower  Level  Influence,  Human 
Resources  Primacy). 

. Supervisory  Leadership  (Support,  Work  Facilitation,  Team 
Building  and  Goal  Emphasis). 

. Peer  Leadership  (Support,  Work  Facilitation,  Team  Building 
and  Goal  Emphasis). 
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Thus,  all  discriminant  function  classification  is  based  on  the  three  super- 
indexes, each  being  the  sum  of  its  component  indexes. 

The  total  data  base  available  for  this  project  was  divided  randomly 
into  two  equal  subsets.  These  samples  contained  over  3,000  work  groups 
each,  all  of  which  had  previously  been  assigned  a "correct"  classification 
by  the  distance  function  algorithm.  From  the  developmental  sample, 
discriminant  function  weights  were  calculated.  The  application  of  these 
resulting  discriminant  functions  to  the  test  sample  is  described  in  a later 
section. 

The  application  of  this  technique 

. does  require  a fairly  large  developmental  sample  with 
use  in  a typology  with  numerous  types, 

. does  require  prior  "correct"  classification  of  the 
developmental  sample, 

. does  require  extensive  calculations  to  develop  the 
discriminant  functions, 

. does  require  involved  calculations  to  assign  a new 
work  group,  indicating  the  use  of  a computer. 

Bayesian  Method 

Principally,  the  Bayesian  approach  involves  the  establishment  of  a 
set  of  prior  probabilities  and  likelihood  functions,  the  calculation,  for 
a new  work  group's  actual  scores,  a set  of  posterior  probabilities,  and  the 
assignment  of  that  work  group  to  the  type  for  which  the  posterior  probability 
is  the  largest.  A more  detailed  explanation  for  the  consideration  of  this 
approach  is  included  in  Bowers,  et  al  (1977). 
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For  the  present  work,  the  development  of  the  prior  probabilities  was 
simply  the  relative  frequencies  of  each  type  in  the  entire  data  set. 

As  discussed  in  Bowers,  et  al  (1977),  the  development  of  the  likelihood 
functions,  which  are  used  in  the  calculations  of  the  posterior  probabilities, 
is  pivotal.  The  family  of  distributions  used  to  develop  the  likelihood 
functions  is  critical.  The  multivariate  normal  family,  while  mathematically 
convenient,  has  the  shortcoming  that  it  permits  arbitrarily  large  or  small 
values  for  its  observations.  The  values  obtained  from  the  work  groups  are 
means  from  five  point  scales,  and  thus  are  bounded.  The  multivariate  normal 
family  is  also  symmetrical  in  each  dimension.  For  the  same  types  in  the 
typology  which  have  relative  large  index  means,  say  near  4.7,  and  say 
a standard  deviation  of  .3,  it  is  impossible  to  be  more  than  one  standard 
deviation  above  the  mean  (the  maximum  score  being  five),  and  some  values  do 
fall  below  4.4,  resulting  in  asymmetrical  distributions.  For  these  reasons, 
the  multivariate  normal  distribution  was  omitted  from  further  consideration. 

Another  family  of  distributions  which  was  investigated  was  the  beta 
family.  This  family  has  density  functions  of  the  form 

<’> 

for  o<y<l , and  a>o,B>o.  The  different  members  of  the  family  (for  various 
combination  of  a and  B),  are  not  necessarily  symmetrical,  and  have  bounded 
range  (o.l).  A simple  conversion  of  a work  group  score,  say  x,  by 
y = (x-l)/4  results  in  values  in  the  appropriate  range. 

In  order  to  fit  a member  of  the  beta  family  to  an  empirical  distribution, 
one  can  estimate  the  parameters  a and  B by  the  following: 

(2)  a = ^tlM(1-M)-S2] 

B - ^[M(l-M)-S2] 


(3) 
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where  M = 1/4(Y-1)  and  S2  = j-gV,  X and  V being  the  mean  and  variance  of 
an  index  on  the  five  point  scale.  These  equations  result  from  the  fact 
that  the  mean  and  variance  of  the  beta  distribution  are  given  by 

V = a/(a+0), 

a2  = a0/[(ct+0+l  )(a+0)2]. 


The  procedure  for  computing  the  posterior  probabilities  for  a new  work 
group,  given  the  likelihood  functions,  is  as  follows: 

#1  From  among  the  profiles  of  types  1 through  17,  select  one 
to  serve  as  a standard,  say  #3. 

#2  Let  Q.  represent  type  j,  for  j=l,...,17.  Then  let  P(Q.)  be 
J J 

the  relative  frequency  of  type  j (based  on  the  distance  function 
classification  of  all  work  groups  in  the  sample). 

#3  Let  a new  work  group  have  a vector  of  converted  index  scores 
given  by  I = (y 1 4) , where  y^  = [(X^-lJ/4,  X^  being 

1 L 

the  1 actual  index  score. 

<4  Now  compute,  for  j=l,...,17,  the  value  of  the  odds-1 i kel i hood 


ratio 

(4) 


P(Qj 1 1 ) 
p(Q3| I) 


IlilV 

fi 3 (yi ) 


P(QJ 
p(q3  ) 


where  f.^  is  the  estimated  density  function,  given  in  (1),  for 
the  1^  index  within  the  j**1  type.  Note  that  a and  0 also  change 
for  each  1 and  j.  Implicit  at  this  point  is  the  prior  calcula- 
tion of  the  estimated  parameters  a and  0 for  all  14  (indexes)  x 
17  (types)  cases. 
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# 5 Now  compute  P( Qa | I ) by 

P(Qa| I)  = 


17  P(Q  . 1 1 ) 


y=i  p(Q3|i). 

#6  Returning  to  equation  (4),  it  is  now  possible  to  compute  the 


posterior  probabilities  P(Qj|I)  by 


(5) 


P(Qjil)  = 


P(Qi  1 1 )" 

— v 

P(Qj|I) 


p(Q3|i). 


Again  note  that  the  selection  of  profile  3 was  arbitrary,  and 
any  of  the  17  types  could  have  been  selected. 

#7  Find  the  value  of  j for  which  the  posterior  probability, 
P(Qj|l)>  j=l  ,2,... ,17,  is  largest  and  assign  the  new  work 
group  to  that  type. 

#8  Repeat  steps  #3  through  #7  for  all  work  groups  that  are  to  be 
classi fied. 


It  is  important  to  note  here  that  this  approach  assumes,  at  step  4, 
that  the  indexes  are  independent.  That  is,  the  joint  (multivariate)  density 
of  all  14  indexes  can  be  equated  to 

14 

f(yi,y2 yiJ  = 1 1 fi (y,- ) 

i=l 

only  when  the  indexes  (y^'s)  are  independent.  In  the  present  case,  they 
are  not.  As  suggested  elsewhere  (Bowers,  et  al . 1977,  pg.  57),  one  can 
consider  grouping  the  indexes  into  relatively  independent  clusters. 
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The  application  of  the  Bayesian  approach 

. does  require  a fairly  large  developmental  sample, 

. does  require  prior  "correct"  classification  of  the 
developmental  sample, 

. does  require  a relatively  independent  set  --  or 
collection  of  relatively  independent  subsets  -- 
of  predictors, 

. does  require  extensive  calculations,  thus  computer 
resources,  for  both  developmental  and  application 
to  new  work  groups. 


The  results  of  the  application  of  this  Bayesian  approach  are  presented 
in  the  results  section. 
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BASIC  PREPARATION  OF  THE  DATA 

The  data  used  in  this  study  represent  organizational  measures  on  both 
military  and  civilian  units.  The  measures  of  behavior  within  organizational 
life  are  comparable  across  both  settings.  The  purpose  of  this  section  is 
to  describe  the  data  set  used  for  this  study. 

The  existing  national  (civilian)  normative  file  of  the  Survey  of 
Organizations  (S00)  contains  5,994  groups.  It  represents  the  total  body  of 
data  collected  since  1966  from  some  36,607  persons  and  137  organizations 
in  a broad  segment  of  the  civilian  industrial  population.  As  such,  it 
represents  many  different  industries,  functions,  and  hierarchical  levels. 

Available  also  are  data  from  two  independent  military  samples.  The 
first  of  these  contains  more  the  787  usable  groups  of  Navymen  from  whom 
questionnaire  data  on  $00  indexes  were  collected  in  late  1972  and  early 
1973.  The  second  contains  668  groups  of  Army  soldiers  from  whom  data  were 
collected  in  late  1974  and  early  1975.  In  each  of  these  instances,  in 
order  to  satisfy  the  need  for  intact  units,  it  was  decided  to  collect  data 
from  all  members  of  a selected  number  of  organizational  subunits  or  "nodules. 
These  modules  consisted  of  a pyramid  of  work  groups  three  echelons,  or  tiers, 
tall.  Thus  data  were  collected  from  all  members  of  the  three  organizational 
levels  immediately  below  a designated  "module  head."  Modules  themselves  were 
selected  by  what  amounts  to  a stratified  random  sampling  procedure.  Methods 
are  spelled  out  in  greater  detail  in  two  technical  reports  (Michaelsen,  1973; 
Spencer,  1975), 


Taken  together,  these  various  data  sets  comprise  a sample  of  nroups,  to 
be  employed  in  the  main  analyses.  From  the  onset  of  the  project  to  the 
present  time,  these  various  data  sets  have  been  reformated  so  that  all 
share  a common  format.  All  data  have  been  entered  into  a single  large 
file. 


Measures  Used 


The  Survey  of  Organizations  contains  in  its  1974  edition  16  standard 
indexes.  Two  of  these,  because  they  have  not  been  universally  used  since 
the  start  of  the  data  bank,  will  be  dropped.  The  14  which  remain  will 
form  the  survey  index  measures  to  be  used  in  the  present  study: 

Organizational  Climate 

Decision  Making  Practices  — the  manner  in  which  decisions 
are  made  in  the  system:  whether  they  are  made  effectively, 
made  at  the  right  level,  and  based  upon  all  of  the 
available  information  (4  item  index). ; 

Communication  Flow  --  the  extent  to  which  information  flows 
freely  in  all  directions  (upward,  downward,  and  laterally) 
through  the  organization  (3  item  index). 

Motivational  Conditions  — the  extent  to  which  conditions 
(people,  policies,  and  procedures)  in  the  organization 
encourage  or  discourage  effective  work  (3  item  index). 


Human  Resources  Primacy  — the  extent  to  which  the  climate, 
as  reflected  in  the  organization's  practices,  is  one  which 
asserts  that  people  are  among  the  organization's  most 
important  assets  (3  item  index). 

Lower  Level  Influence  — the  extent  to  which  non-supervisory 
personnel  and  first-line  supervisors  influence  the  course  of 
events  in  their  work  areas  (2  item  index). 
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Supervisory  Leadership 

Supervisory  Support  --  the  behavior  of  a supervisor  toward 
a subordinate  which  serves  to  increase  the  subordinate's 
feeling  of  personal  worth  (3  item  index) 

Supervisory  Team  Building  — behavior  which  encourages 
subordinates  to  develop  mutually  satisfying  interpersonal 
relationships  (2  item  index). 

Supervisory  Goal  Emphasis  — behavior  which  generates 
enthusiasm  (not  pressure)  for  achieving  excellent 
performance  levels  (2  item  index) 

Supervisory  Work  Facilitation  --  behavior  on  the  part  of 
supervisors  which  removes  obstacles  which  hinder  successful 
task  completion,  or  positively,  which  provides  the  means 
riecessary  for  successful  performance  (3  item  index) 


Peer  Leadership 

Peer  Support  — behavior  of  subordinates,  directed  toward 
one  another,  which  enhances  each  member's  feeling  of 
personal  worth  (3  item  index). 

Peer  Team  Building  — behavior  of  subordinates  toward  one 
another  which  encourages  the  development  of  close,  coopera- 
tive working  relationships  (3  item  index). 

Peer  Goal  Emphasis  --  behavior  on  the  part  of  subordinates 
which  stimulates  enthusiasm  for  doing  a good  job  (2  item 
index) . 

Peer  Work  Facilitation  --  behavior  which  removes  roadblocks 
to  doing  a good  job  (3  item  index). 


Satisfaction  — a measure  of  general  satisfaction  made  up  of 
items  tapping  satisfaction  v/i th  pay,  with  the  supervisor, 
with  cc-workers  (peers),  with  the  organization,  with 
advancement  opportunities,  and  with  the  job  itself 
(7  item  index). 
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The  typology  of  work  groups  to  be  used  in  this  study  is  reported 
Bowers  and  Hausser  (1977),  contains  17  types,  and  is  based  on  the  indexes 
of  the  Survey  of  Organizations.  The  resulting  types  have  different  profiles 
across  the  indexes,  with  the  patterns  of  these  profiles  being  quite  distinct. 

The  data  set  contains  work  groups  at  a variety  of  organizational  levels. 
For  purposes  of  analysis,  the  S00  data  has  been  coded  as  representing  one 
of  the  following  five  distinct  and  exhaustive  classifications: 

Level  4 --  Top  Management.  Responses  of  employees  who  report  to 
the  upper-most  level  of  management.  Includes  responses 
from  individuals  in  vice-presidential,  major  departmental 
heads,  or  equivalent  positions  who  report  to  the  head 
of  the  organization. 

Level  3 --  Middle  Management.  Responses  from  subordinates  reporting 
to  the  top  management  positions  listed  above.  Includes 
responses  of  assistant  department  heads  or  general  super- 
intendents (those  who  head  up  sub-departments  within  major 
departments),  technical/professional  employees  (with  at 
least  two  subordinate  levels  below  them),,  assistant 
controllers,  etc. 

Level  2 — Second-line  Supervision.  Responses'of  foremen  or  equi- 
valent positions  about  their  supervisors  (e.g.,  general 
foremen).  Personnel  responding  here  may  include  any 
supervisory  personnel  from  the  lowest  (or  two  lowest, 
depending  on  size  of  organization)  levels. 

Level  1 — 1st  line  Supervision  (blue-collar).  Responses  of  non- 
supervisory  employees  about  their  supervisors  (e.g., 
foremen).  Includes  hourly  or  equivalent  line  workers. 

Level  0 --  1st  line  Supervision  (white-collar).  Responses  of  non- 
supervisory  employees  about  their  supervisors  (e.g., 
clerical  supervisors  in  accounting,  personnel,  etc.). 

May  also  include  non-supervisory  professional  or  technical 
employees. 

An  analysis  of  the  present  data  to  reflect  the  relative  distribution 
of  the  various  levels  is  contained  in  Table  1. 

Another  factor  which  distinguishes  among  work  groups  is  work  group 
size.  The  frequency  distribution  of  work  group  size  is  contained  in 
Table  2.  The  existence  of  the  very  small  (N=l)  and  large  work  groups  are 


TABLE  1 


DISTRIBUTION  OF  WORK  GROUPS  BY  LEVEL 


0 

1 

Level 

2 

3 

4 

Number  of 
Groups 
Classified 

35.0% 

22.6% 

27.1% 

12.5% 

2.8% 

All  work  groups 

1 ,788 

1 ,157 

1 ,385 

640 

141 

5,111* 

Work  Groups 

31.9% 

23.1% 

28.3% 

13.7% 

3.1% 

Size<2<N  40 

1 ,413 

1 ,021 

1 ,250 

604 

135 

4,423** 

*2,338  work  groups  have  no  level  coding 

**Of  the  6,032  work  groups  with  size  between  2 and  40,  1,609  have  no 
level  coding. 
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evident.  The  existence  of  work  groups  with  N=1  most  likely  reflects 
situations  where  (a)  other  members  of  a small  work  group  were  absent, 

(b)  there  occurs  over-specifying  supervisory  reporting  relationships  beyond 
those  that  really  exist,  (c)  specialized  survey  administrations  (e.g.,  to 
salaried  employees  only),  (d)  the  use  of  samples  of  employees,  or 
(e)  miscoding/non-codings  of  work  group  identification  numbers.  Large  work 
groups  may  result  from  certain  manufacturing  settings  where  a supervisor 
is  directly  responsible  for  many  production  people,  or  where,  for  whatever 
reason,  middle  levels  of  supervision  were  not  identified. 

As  a partial  response  to  the  existence  of  very  large  or  small  work  groups 
in  the  data  set,  some  analyses  are  done  for  only  those  work  groups  with 
size  between  2 and  40,  inclusive.  As  shown  in  Table  3,  there  are  6,032 
work  groups  whose  size  is  between  2 and  40.  The  existence  of  samples 
of  employees  were  especially  significant  in  some  of  the  military  applications 
of  the  survey  administration.  In  fact,  32 % of  Army  work  groups  and  38.4% 
of  the  Navy's  were  of  size  one,  where  14.7%  of  the  civilian  sample  had  work 
group  N equal  to  one.  The  resulting  6,032  work  groups  with  size  2 through 
40  are  thus  made  up  of  5,095  civilian,  452  Army  and  485  Navy  units. 

In  summary,  there  are  a total  of  7,449  work  groups,  of  which  6,944  have 
complete  information  for  purposes  of  classification.  There  are  6,032 
work  groups  with  size  two  through  40.  Of  these,  5,651  have  complete 
information.  For  purposes  of  the  present  analysis,  the  results  presented 
will  be  for  the  6,944  work  groups  and/or  the  5,651  work  groups. 
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RESULTS  OF  ANALYSES 


Previous  sections  of  this  report  have  described  the  background  and  history 
of  the  diagnosis/ciassification  problem,  as  well  as  approaches  toward 
solution.  In  this  section,  numerical  results  of  the  various  approaches 
under  investigation  are  presented,  as  well  as  some  discussion  of  their 
relative  merits.  The  basic  results  of  the  three  approaches  still  under 
consideration  are  presented  first  individually,  and  then  a relative  comparison 
between  them. 

Distance  Function 

The  first  step  was  to  assign  all  work  groups  in  the  data  set  to  one  of 
the  17  types  of  the  typology  according  to  which  type  the  work  was  closest 
to  in  the  Euclidean  sense.  In  order  to  do  this,  it  was  necessary  for  all 
14  indexes  be  available  for  a work  group.  In  some  instances,  due  to  the 
form  of  the  S00  used  and/or,  most  frequently,  some  items  not  being  answered, 
all  indexes  were  not  always  available.  Table  4 presents  the  results  of 
classifying  all  work  groups  with  complete  data  into  the  17  types  of  the 
typology.  Table  4 also  contains  the  same  information  for  those  work  groups 
for  which  the  work  group  size  is  between  two  and  40.  The  differences  between 
the  pattern  for  all  work  groups  and  those  in  the  two  to  40  range  are  also 
presented.  The  largest  difference  is  in  Type  1,  which  is  the  highest  overall 
profile  of  scores.  This  difference  might  be  expected;  i.e.,  for  those  real 
work  groups  with  only  one  subordinate,  the  relationship  between  subordinate 
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and  manager  would  tend  to  be  stronger,  have  greater  information  flow  and 
participation  in  decision  making,  etc.  The  actual  distribution  of  1,290 
classifiable  work  groups  of  size  N=1  or  N>40  is  given  in  Table  5. 

Two  other  distributions  are  of  interest.  The  first  is  the  distribution  of 
types  by  level  of  the  work  group  within  the  organization.  The  results  of 
this  analysis  are  presented  in  Table  6.  The  general  pattern  for  this  table 
is  a higher  frequency  in  the  low  numbered  types  for  higher  level  management 
groups.  This,  too,  is  as  expected.  The  higher  score  patterns  (types)  among 
higher  level  managerial  groups  have  long  been  known  to  users  of  the  S00,  and 
are  acknowledged  by  the  use  of  different  norm  sets  (conversions  from  raw 
scores  to  percentiles)  for  each  level. 

The  second  analysis  of  interest  is  a comparison  of  types  by  the  civilian/ 
military  breakdown.  This  analysis  is  reported  in  Table  7.  The  notable 
feature  here,  other  than  the  clear  difference  between  military  and  civilian 
patterns,  is  the  similarity  between  the  Navy  and  Army  patterns.  There  are 
more  high,  straight  line  profiles  in  the  civilian  data,  more  low  straight 
line  and  supervisor-divergent  profiles  in  the  military  sample  (see  Bowers, 

1975;  Bowers  & Hausser,  1977). 

Decision  Tree 

The  Decision  Tree  algorithm  used  in  the  present  research  was  described 
in  a previous  section  of  this  report.  The  actual  implementation  issues  and 
results  are  reported  herein.  The  two  major  implementation  issues  are 
described  first. 

When  computing  the  fit  of  a new  work  group  to  the  17  types,  the  possible 
scores  fall  in  the  15  point  range  from  zero  (no  index  score  falls  in  any  of 
the  corresponding  intervals)  to  14  (all  14  index  scores  fall  in  their 
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corresponding  intervals).  The  difficulty  with  this  fact  is  the  possibility 
of  ties,  in  particular  ties  for  the  best  fit.  If,  for  example,  the  17  fit 
scores  for  a particular  work  group  happen  to  be  (7,  3,  6,  8,  11,  9,  6,  8,  6, 

9,  10,  11,  6,  5,  3,  7,  9),  respecti vely,  then  there  is  a tie  between  types 
5 and  12.  Conceptually,  it  would  be  possible  for  all  types  to  tie.  In 
actuality,  for  the  intervals  investigated,  the  ties  are  most  frequently 
among  two  to  four  types.  The  classiciation  problem  is,  "How  do  you  assign 
work  groups,  whose  best  fit  according  to  the  decision  tree  algorithm,  is 
tied  among  two  or  more  types?" 

Two  approaches  have  been  taken  with  respect  to  the  results  presented 
below.  One  approach  is  not  to  classify  a group  which  has  its  best  fit  tied 
among  two  or  more  types.  A second  approach,  knowing  the  correct  classification, 
is  to  consider  the  classification  to  be  correct  if,  for  those  work  groups  with 
best  fit  ties  among  two  or  more  types,  the  correct  type  is  among  those  types 
tied  for  best  fit. 

A second  implementation  issue  is  the  selection  of  the  intervals  used  in 
the  algorithm.  In  the  present  work,  while  it  had  been  decided  to  go  with 
symmetric  intervals  about  the  ideal  value  (mean  of  groups  for  a particular 
type  used  in  developing  the  typology),  the  optimum  width  of  such  intervals 
was  unknown.  The  choice  was  to  inspect  a variety  of  intervals  widths  and 
explore  their  effectiveness.  Tables  8,  9,  10,  11  and  12  show  the  frequency 
of  correct  classification  and  the  overall  percent  of  correct  classification 
for  the  five  widths,  1.4,  1.5,  1.67,  1.75  and  1.8  standard  deviations  either 
side  of  the  ideal  value.  In  these  tables,  wor’-  groups  with  tied  best  fits 
were  counted  as  correctly  classified  if  the  correct  type  was  one  of  those 
types  tied  for  best  fit. 


THE  COMPARISON  OF  CLASSIFICATION 
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NUMBER  CORRECT  4895  PERCENT.  CORRECT  ’0.5. 
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NUMBER  CORRECT  5115  PERCENT  CORRECT  73.7 
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BY  THE 
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NUMBER  CORRECT  5226  PERCENT  CORRECT 
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Inspection  of  Tables  8 through  12  show  an  increasing  percentage  accuracy 
with  wider  Intervals.  This  pattern  Is  somewhat  misleading  in  that  there  is 
a corresponding,  non-linear  (greater  than  linear)  increase  in  the  frequency 
of  work  groups  with  tied  best  fit  scores.  The  summary  of  these  results 
across  the  five  tables  can  be  seen  in  Table  13. 

In  selecting  a standard  interval  width,  the  primary  concern  would  seem 
to  be  one  of  ties.  As  shown  in  Table  13,  the  percent  accuracy  of  classifica- 
tion does  not  effectively  change  If  one  classifies  only  those  work  groups  for 
which  there  are  not  ties  for  best  type.  This  result  needs  to  be  interpreted, 
however.  In  light  of  the  realization  that  there  are  fewer  work  groups  to 
classify  with  the  larger  intervals  than  the  smaller.  Thus,  for  intervals  of 
+1.4  standard  deviations,  there  were  (6,941-2,190  with  ties  =)  4,751  work 
groups  to  classify,  whereas  with  Intervals  of  +1.8  standard  deviations,  there 
were  (6,941-3,001  with  ties  =)  3,940  work  groups  to  classify.  Thus,  the 
larger  Interval  classified  (57.5%  of  3,940  =)  2,266  correctly,  whereas  the 
smaller  classified  (56.9%  of  4,751  =)  2,705  correctly.  These  results  indicate 
selecting  one  of  the  smaller  Intervals. 

A second  concern,  however  minor,  is  the  distribution  of  the  number  of 
ties.  In  practice,  if  approximately  30  to  35%  of  work  groups  have  tied  best 
fit  scores,  the  classification  problem  is  different  if  most  work  groups  have 
only  two  types  which  tie  or,  say,  four  or  five  types  are  tied.  In  the  former, 
for  treatment  purposes,  one  might  explore  the  advantage  of  both  suggested 
treatments,  whereas  in  the  latter,  one  would  still  be  selecting  among  four 
to  five  treatments.  Table  14  demonstrates  tnat  the  number  of  tied  types  is 
principally  two,  especially  in  the  shorter  intervals. 
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TABLE  13 

ACCURACY  OF  CLASSIFICATION  BY  THE  DECISION  TREE 
(All  Work  Groups)* 


Number 

of  Standard  Deviation 
On  Either  Side  of  the 

Units  Used 
Mean 

1 .40 

1 .50 

1 .67 

1 .75 

1 .80 

Percent  Accurate 

Classi fication 

70.5 

71 .6 

73.7 

75.3 

75.9 

Percent  of  work  groups 
which  had  best  fit  in 
two  or  more  work  groups 

31  .6 

33.2 

38.1 

41  .7 

43.2 

Percent  of  work  groups 
without  ties  that  were 
accurately  classified 

56.9 

57.6 

57.5 

57.7 

57.5 

*Based  on  6941  work  groups,  regardless  of  work  group  size. 
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TABLE  14 

DISTRIBUTION  OF  NUMBER  OF  TIED  TYPES 


Interval 

About 

Mean 

t 

2 

3 

4 

Number 

5 

of  Types  That 

6 7 

Tied 

8 

N 

+ 1.40o 

70.8 

21.1 

6.2 

1.1 

0.5 

0.1 

0.0 

2190 

+ 1.50a 

69.2 

22.6 

5.6 

1.7 

0.7 

0.1 

0.0 

2302 

+ 1.67a 

67.7 

23.1 

7.1 

1 .2 

0.6 

0.1 

0.0 

2646 

+ 1.75a 

65.0 

23.7 

8.6 

1 .7 

0.7 

0.2 

0.1 

2891 

+ 1.80a 

62.3 

24.9 

9.5 

2.3 

0.7 

0.3 

0.1 

3001 
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Another  feature  of  concern  is  whether  the  ties  are  distributed  j 

proportionately  across  actual  types  or  not.  Table  15  presents  the  dis- 
tribution pattern  of  the  work  groups  with  tied  types  for  best  fit.  By 

i 

comparison  with  the  total  distribution,  one  sees  that  there  are  somewhat 

I 

more  ties  among  the  relatively  frequent  correct  types  and  fewer  ties  among 
the  more  infrequent  correct  types.  The  diagnostic  implications  are  that 

the  decision  tree  process  will  do  somewhat  better  in  identifying  the  rarer  J 

t 

types  than  the  more  common  types.  ] 

A fourth  feature  is  the  degree  of  compatability  between  a particular 

work  group's  index  values  and  the  intervals,  which  has  been  called  the  fit.  ^ 

The  analysis  of  best  fit  values,  for  only  those  work  groups  that  had  tied  j 

types,  is  presented  in  Table  16.  It  shows  that  the  average  fit  becomes  j 

higher  with  larger  intervals,  as  expected.  For  differentiation  purposes,  j 

it  also  suggests  the  use  of  smaller  intervals. 

* ] 
As  a consequence  of  the  results  presented  above,  it  was  decided  to 

select  the  intervals  given  by  +1.5  standard  deviations.  For  this  size,  the 

i 

accuracy  of  classification  was  71.6%  with  33.2%  of  the  work  groups  having  j 

tied  best  fit  types.  The  accuracy  was  57.6%  when  classifying  only  those  work 
groups  that  did  not  have  tied  best  fit  types.  In  future  references  to  thq 
decision  tree  algorithm,  the  interval  used  will  be  +1.5  standard  deviations. 

A last  feature  of  the  classification  accuracy  of  this  procedure  is  the 

I 

accuracy  with  which  it  classifies  within  each  type.  That  Is,  given  that  a 
work  group  is  of  a certain  actual  type.  Is  the  probability  of  being  correctly 
classified  the  same  as  it  Is  If  it  were  a different  actual  type?  The  results, 

* 

computed  from  the  figures  In  Table  9,  are  presented  In  Table  17.  As  can 

| 1 

be  seen,  the  percentages  of  correct  classification  are  not  uniform  across 
types.  The  greatest  difficulty  Is  for  types  3,  4,  and  5.  These  types  represent 


PERCENTAGE  DISTRIBUTION  OF  WORK  GROUPS  WITH  TIED  BEST  FIT  BY  CORRECT  TYPE 
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TABLE  16 

AVERAGE  BEST  FIT* 


Interval  About 

Mean 

Mean  Fit** 

Percent  of  Groups 
With  Maximum  Fit** 

+ 1.40o 

11 .104 

7 .1 

+ 1.50a 

11.603 

14.2 

+ 1.67a 

12.411 

32.4 

+ 1.75a 

12.658 

40.9 

+ 1.80a 

12.827 

46.9 

I 


**The  fit  is  the  number  of  indexes  for  which  the  work  group 
data  falls  within  the  designated  range;  the  maximum  is  14. 


**These  figures  are  for  those  work  groups  which  had  best  fit 
in  two  or  more  types  (i.e.,  tied  for  which  type  it  is  to 
be  classified). 


I 
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TABLE  17 

DECISION  TREE  ACCURACY  BY  TYPE 


Correct  Type 

Percent  Correct 

1 

94.4 

2 

71.1 

3 

57.5 

4 

58.5 

5 

51.5 

6 

72.6 

7 

61.0 

8 

72.5 

9 

77.8 

10 

85.7 

11 

83.3 

12 

80.6 

13 

81 .2 

14 

73.9 

15 

87.3 

16 

80.0 

17 

89.4 

Overall 

71  .6 
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SOO  score  profiles  which  fall  close  to  the  50th  percentile.  In  examini'ig 
Table  9,  one  sees  that  the  predominant  pattern  is  to  assign  these  straight 
line  types  into  the  non-straight  line  types  (the  latter  are  types  9 to  17). 
Most  of  the  divergent  profiles  begin  with  score  patterns  in  this  45th  to 
65th  percentile  range,  (the  range  covered  by  types  3,  4,  and  5)  which  may 
explain  the  difficulty  of  correctly  classifying  these  straight  line  types, 
while  not  having  nearly  the  difficulty  with  the  other  straight  line  types 
which  are  higher  or  lower. 

Discriminant  Function 

The  discriminant  function  analysis  reported  here  involves  classifying 
work  groups  on  the  basis  of  three  super  indexes:  organizational  climate, 
supervisory  leadership,  and  peer  leadership.  The  reasons  behind  the 
consolidation  of  the  regular  SOO  indexes  into  these  three  super  indexes 
were  given  in  an  earlier  section.  The  total  data  set  was  divided  into 
two  random  subsets,  each  consisting  of  3,016  work  groups  whose  size  ranged 
from  two  to  40.  The  discriminant  functions  were  generated  from  one  sample 
and  then  applied  to  the  other  sample. 

The  pattern  of  classification  of  this  procedure  is  given  in  Table  18. 
The  percentage  of  correct  classification  is  based  on  2,823  work  groups, 
(2,828  = 3,016-188  work  groups  without  "correct"  classification). 

The  discriminant  functions  were  also  applied  to  the  developmental 
sample  (i.e.,  the  data  set  from  which  they  were  generated.)  The  results 
are  given  in  Table  19.  The  remarkab'le  consistency  of  the  accuracy  of 
prediction  (77. 0%  ft  77.4%)  indicate  substantial  stability  across  data 
sets.  This  is  significant  and  indicates  portability  of  such  equations 
as  well  as  decreasing  the  Importance  of  a large  data  set  for  developmental 
purposes . 


THE  COMPARISON  OF  CLASSIFICATION 
BY  THE 

DISTANCE  FUNCTION  (ROWS)  AND  DISCRIMINANT  FUNCTION  (COLUMNS) 


NUMBER  CORRECT  2177.  PERCENT  CORRECT  77.0 


DEVELOPMENTAL  SAMPLE 
THE  COMPARISON  OF  CLASSIFICATION 
BY  THE 

MST ANLt  FUNCTION  (ROWS)  AND  DISCRIMINANT  FUNCTION  (COLUMNS) 


78 


NUMBER  CORRECT  2185.  PERCENT  CORRECT  77.4 
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The  combination  of  the  developmental  and  test  samples  are  reported 
in  Table  20.  For  Table  21,  the  total  data  set,  including  those  work  groups 
of  size  one  or  greater  than  40  which  were  previously  omitted,  was  submitted 
for  discriminant  function  classification.  The  decline  in  accuracy  (albeit, 
only  from  77.2%  to  76.6%)  indicates  that  the  odd  sized  work  groups  (mostly 
N=1 ) are  not  classified  as  accurately  as  those  with  size  between  two  and  40. 

The  last  analysis  of  these  figures  was  to  examine  the  accuracy  of 
correct  classification  by  "correct"  type.  The  results  of  these  calculations 
are  given  in  Table  22.  The  two  types  with  the  lowest  accuracy  figures 
are  types  three  and  five.  In  both  types,  the  adjacent  straight  line 
profiles,  combined,  would  appear  to  be  nearly  as  attractive  alternatives 
as  the  non-straight  line  profiles  (types  nine  to  17).  • 

Bayesian  Method 

The  Bayesian  approach,  as  discussed  earlier,  invol ved  several  steps 
(see  page  46).  Important  to  the  method  is  the  concept  of  independence. 

This  independence  is  not  with  respect  to  work  groups,  but  with  respect  to 
the  S00  indexes  which  are  the  "predictor  variables."  This  section 
discusses  the  results  in  applying  the  Bayesian  method  to  tne  organizational 
diagnosis  problem. 

As  discussed  by  Bowers,  et  al . (1977),  there  are  two  traditional 
approaches  to  handling  the  independence  issue:  (1)  ignoring  the  lack  of 
independence,  and  (2)  clustering  the  predictor  variables  into  relatively 
independent  subsets.  The  first  approach  raised  significant  computational 
problems.  To  describe  them,  it  is  nelpful  to  go  through  the  implementation 
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The  computer  program  that  operationalizes  the  steps  given  earlier 
(page  16)  required  the  estimates  of  the  two  parameters,  a and  p,  for  the 
beta  distributions  which  were  used  to  model  the  actual  (but  unknown) 
distributions  of  each  index  within  each  type.  The  a values  ranged  from 
just  over  one  to  nearly  28,  while  8 values  were  from  near  zero  to  20.5. 

The  resulting  constant  terms  in  the  densities  (the  ratio  of  gamma  terms  in 
equation  (1)  on  page  45)  ranged  from  two  to  two  x 1031.  The  result  of  having 
numbers  of  this  magnitude,  when  computing  the  odds-likelihood  ratio  (given 
by  equation  4,  step  4,  page  46)  was  obtaining  values  which  exceeded  the 
normal  storage  capabilities  of  computers  (1072  or  10"72).  Thus,  the 
algorithm  was  not  possible  to  implement  in  its  present  form.  This  is 
primarily  due  to  the  lack  of  independence  between  indexes,  and  partially 
due  to  the  number  of  predictor  variables. 

In  the  investigation  of  possible  clusters  of  indexes  which  have  a 
reasonable  conceptual  basis,  the  pairwise  correlations  remained  above  0.5, 

The  multiple  relationships  would  be  even  larger.  The  inability  to  meet  the 
independence  criterion  of  this  technique  remains  a major  obstacle.  It  also 
seems  highly  unlikely  that  any  set  of  predictors  (not  just  S00  measures) 
chosen  to  reflect  organizational  functioning  will  meet  the  independence 
criterion. 

While  multivariate  models  not  requiring  independence  are  available  for 
use  as  prior  distributions,  the  sample  N necessary  to  estimate  all  the 
parameters  would  exceed  greatly  the  relatively  large  N=7,000  that  is  presently 
available. 

It  seems  unlikely,  therefore,  that  implementation  of  Bayesian  classifi- 
cation procedures,  within  the  constraints  of  present  application  techniques. 
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will  prove  viable  to  data-based  organizational  diagnosis.  It  would  appear 
that  it  rests  with  the  mathematical  statisticians  to  develop  (and  have 
distributed)  techniques  for  using  Bayesian  approaches  to  highly  inter- 
dependent, multi-faceted  data. 
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TABLE  22 

t DISCRIMINANT  FUNCTION  ACCURACY  BY  TYPE 


Type 

Percent  Accurately  Classified 

Total 

Sample 

Test 

Sample 

Developmental 

Sample 

1 

90* 

81 

89 

2 

85 

90 

92 

3 

63 

66 

64 

4 

77 

79 

76 

5 

69 

69 

70 

6 

82 

81 

82 

7 

76 

77 

77 

8 

76  . 

78 

78 

9 

75 

75 

74 

10 

81 

82 

85 

11 

77 

73 

82 

12 

88 

93 

88 

13 

77 

78 

75 

14 

76 

75 

79 

15 

86 

83 

86 

16 

81 

82 

77 

17 

76 

81 

77 

*The  total  sample  values  are  not  the  average  of  the  two 
sample  values  as  the  former  only  includes  work  groups 
or  size  one  and  those  larger  than  N=40. 


ADDITIONAL  CRITERIA 


The  results  presented  thus  far  have  reflected  one  primary  concern 
of  classification  procedures,  that  being,  proportion  of  correct  classifi- 
cation. There  are  other  accuracy  oriented  criteria  as  well  as  non-accuracy 
based  criteria  which  are  also  of  interest. 

One  of  the  accuracy  based  criteria  is  the  degree  to  which  the  clusters 
of  work  groups  (or  other  units  of  analysis)  generated  by  the  classification 
techniques  replicate  the  typology.  Another  criteria  re-defines  the  concept 
of  closeness  of  a work  group  to  the  definitional  values  of  the  typology. 
Still  others  have  examined  the  cost  (loss)  of  misclassi fying  a particular 
work  group  into  the  "wrong"  type.  These  accuracy  oriented  criteria  are 
used  to  assess  the  organizational  diagnosis  procedures  under  consideration. 

In  an  earlier  report  in  this  series  (Bowers,  et  al,  1977),  several 
concepts  relating  to  criteria  other  than  accuracy  based  criteria  were 
discussed  as  possible  alternative  approaches  to  evaluating  classification 
procedures.  Three  such  non-accuracy  based  criteria  are  considered  here. 

One  is  whether  the  amount  of  information  used  in  making  a classification 
can  be  reduced.  Another  is  whether  there  are  considerable  differences 
in  the  amount  of  data  necessary  to  develop  the  procedures.  Finally,  the 
ease  in  calculating  these  data  based  classifications,  whether  they 
can  be  done  "on-site,"  is  a major  feature. 

Reproduce  the  Typology 

If  one  thinks  of  a typology  which  involves  k variables  and  n classes 
as  n points  in  k-dimensional  space,  one  approach  to  evaluating  a classifi- 
cation procedure  is  to  see  how  well  the  clusters  of  classified  units  gather 
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about  the  n points.  To  the  extent  that  the  clusters  don't  group  abou 
the  definitional  types,  one  can  conclude  that  the  lack  of  fit  is  due  to 
either  the  classification  procedure  or  the  inappropriateness  of  the  typology 
itself  (or  a mixture  of  the  two).  The  discussion  here  is  primarily  focused 
at  the  classification  procedures,  but  also  implications  regarding  the 
typology  are  presented. 

Two  major  features  of  the  fit  of  clusters  to  the  typology  are  the 
proximity  of  the  vector  of  means  of  a cluster  to  the  corresponding  defini- 
tional values  of  that  type,  across  all  types,  and  the  degree  of  dispersion 
within  each  cluster.  To  measure  the  proximity,  the  vector  of  index  means 
was  computed  for  each  of  the  17  clusters  which  resulted  from  each  of  the 
three  classification  procedures  (distance  function,  discriminant  function, 
and  decision  tree).  Then  the  distance  between  cluster  mean  vectors  and 
the  definitional  values*  of  tne  respective  types  was  calculated.  The 
results  are  given  in  Table  23. 

Two  aspects  of  this  analysis  are  relevant.  First,  looking  only  at 
the  relative  magnitude  of  the  distances,  first  across  classification 
procedures  and  then  down  the  types,  one  sees  that  the  distance  procedure 
has  the  best  pattern  of  smallest  distances  by  index.  The  decision  tree 
procedure  has  the  second  best  pattern.  The  former  is  as  expected.  The 
latter,  while  possibly  surprising,  is  due  to  the  nature  of  the  algorithm 
which  assigns  work  groups  to  types  by  fitting  them  to  the  definitional 
values. 


*The  values  which  were  used  as  the  definitional  values  of  the  typology  were 
results  of  the  development  of  the  typology  (Bowers  & Hausser,  1975).  In 
particular,  they  were  the  simple  means  of  the  cluster  means  from  each  of 
the  three  samples  used  to  develop  the  typology. 
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TABLE  23 

DISTANCE  BETWEEN  TYPE  CLUSTERS  AND  VALUES 
GIVEN  BY  THE  TYPOLOGY 


Type 

Distance 

Function 

Classified  by 

Discriminant 

Function 

Decision 

Tree 

1 

.3573 

.4655 

.6581 

2 

.4886 

.6836 

.6418 

3 

.2270 

.2518 

.2112 

4 

.2128 

.3100 

.2547 

5 

.1746 

.2567 

.1775 

6 

.4118 

.5794 

.4308 

7 

.2258 

,2517 

.2537 

8 

.3367 

.4347 

.4567 

9 

.3859 

.7001 

.5384 

10 

.3650 

.4818 

.5234 

11 

.4114 

.5769 

.4463 

12 

.3430 

.4768 

.4693 

13 

.4451 

.5547 

.4935 

14 

.2562 

.3312 

.2781 

15 

.3057 

.3782 

.5337 

16 

.5715 

.8907 

.7217 
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The  interpretation  of  these  distances  is  confounded  by  the  fact  tha+ 
the  different  procedures  assigned  different  work  groups,  and  thus,  different 
numbers  of  work  groups,  to  the  individual  types  (see  Table  24).  For  example, 
while  the  decision  tree  procedure  has  the  smallest  mean  distance  to  the 
Type  3 definitional  values,  it  also  classified  244  fewer  groups  as  Type  3 
than  did  the  distance  function  (the  244  groups  were  thus  said  to  be  mis- 
classified).  Hence,  absolute  comparisons  in  Table  23  must  be  made  with 
caution. 

A second  feature  of  the  values  given  in  Table  23  is  the  interpretability 
of  the  numbers.  For  example,  how  does  one  interpret  a distance  of  .3573 
in  14  dimensional  space?  As  a partial  answer  to  this  dilemma,  the  distance 
between  two  pairs  of  types  (using  the  definitional  values)  were  computed. 

They  were: 

distance  (Type  one  to  Type  six)  = 1.5334, 
distance  (Type  five  to  Type  six)  = 1.0218. 

Thus,  the  distance  of  .3573  between  the  distance  function  cluster  one  and 
the  defining  values  is,  at  most,  23  percent  of  the  distance  toward  Type 
2 from  Type  1.  (Given  the  geometry  of  14  dimensional  space,  direc- 
tionality is  not  linear  and  thus  the  direction  of  the  distance  is  difficult 
to  interpret.) 

Again  referring  to  Table  23,  one  notices  considerable  variation  across 
types  with  respect  to  the  distance  from  the  definitional  values.  This 
variation,  however,  is  not  particularly  greater  among  the  divergent  types 
(Types  nine  to  17)  than  among  the  straight  line  types  (Types  one  to  eight). 
Given  that  the  dispersion,  as  will  be  shown  shortly,  is  not  particulary 
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TABLE  24 

NUMBER  OF  WORK  GROUPS  PER  TYPE* 


Classified  by 

Distance  Discriminant  Decision 

Type  Function  Function  Tree 


1 

169 

166 

217 

2 

267 

353 

312 

3 

749 

557 

505 

4 

539 

595 

415 

5 

489 

464 

304 

6 

364 

411 

384 

7 

578 

502 

403 

8 

350 

332 

329 

9 

196 

301 

305 

10 

237 

271 

357 

11 

433 

383 

532 

12 

207 

264 

265 

13 

363 

306 

419 

14 

268 

265 

319 

15 

171 

157 

224 

16 

118 

152 

165 

17 

153 

172 

196 

N=5,651 

*For  only  work  groups  with  N,  2<N<40 
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greater  for  those  types  with  larger  distance  values,  changes  in  the 
definitional  values  of  the  typology  for  those  types  are  suggested. 

The  second  feature  of  fit  is  the  degree  of  dispersion  within  the 
clusters.  An  (unreal)  ideal  model  would  have  all  cluster  members  falling 
exactly  at  the  same  point  in  space.  The  actual  dispersion  was  computed 
for  each  index  within  each  type,  and  averaged  across  indexes.  These 
average  values  are  shown  in  Table  25.  Also  given  is  the  weighted  average 
of  standard  deviations  from  the  three  samples  that  were  used  to  develop 
the  typology.  The  general  pattern  can  be  seen  to  be  that  all  three  classi- 
fication schemes  assign  groups  to  clusters  whose  standard  deviations  are 
smaller  than  for  the  developmental  sample  of  the  typology  (three  subsamples). 

Comparison  of  the  values  in  Table  25  is  simplified  if  one  assigns 
ranks  to  the  four  values  given  for  each  type.  These  values  are  given  in 
Table  26.  The  pattern  of  ranks  for  discriminant  function  shows  that  it 
is  the  least  variable  within  clusters  (recall,  however,  it  was  the  furthest 
distance  from  the  definitional  values).  However,  returning  to  Table  25, 
one  notices  that  most  often  the  difference  in  average  variability  between 
clusters  given  by  the  distance  function  and  the  discriminant  function  is 
less  than  .01.  Effectively,  it  can  be  said  that  the  variance  within 
clusters  is  approximately  the  same  for  all  three  techniques,  all  of  which 
are  better  than  for  the  developmental  sample. 

In  summary,  the  distance  function  seems  to  reproduce  the  typology 
most  effectively  (as  expected).  While  there  is  some  difference  between 
the  distance  of  the  vectors  of  means  of  the  clusters  and  the  definitional 
values  for  the  three  classification  procedures,  the  variability 
within  clusters  is  approximately  the  same  across  procedures. 
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TABLE  26 

RANKS  OF  DISPERSION  WITHIN  TYPES 


Type 

Typology 

Developmental 

Sample 

Distance 

Function 

Discriminant 

Function 

Decision 

Tree 

1 

3 

1 

2 

4 

2 

4 

3 

2 

1 

3 

4 

3 

1 

2 

4 

4 

2 

3 

1 

5 

4 

3 

2 

1 

6 

1 

2 

4 

3 

7 

4 

3 

2 

1 

8 

4 

3 

2 

1 

9 

4 

1 

2 

3 

10 

4 

3 

1 

2 

11 

2 

1 

3 

4 

12 

4 

2 

3 

1 

13 

4 

3 

1 

2 

14 

1 

3 

2 

4 

15 

4 

2 

1 

3 

16 

4 

1 

2 

3 

17 

2 

3 

1 

4 
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All  three  procedures  seem  to  reproduce  the  typology  with  some  effectiveness. 
One  implication  is  that  the  typology  is  not  just  developmental  sample 
based,  and  does  have  usefulness  in  a broader  arena. 


Zero-One  Measure 

As  shown  in  the  Introduction  of  this  report,  two  differt  work  groups 
may  be  equally  distant  from  a particular  type,  but  have  quite  different 
closeness  properties.  A second  way  to  measure  closeness  (other  than  the 
distance  function)  is  to  count  the  number  of  indexes  of  a particular  work 
group  which  are  "far  away"  from  the  type  values.  The  arithmetic  of  this 
counting  function  for  each  work  group  can  be  represented  by: 


14 


i=l 


where  C.  is  given  by 


1,  i f | Y.-X. |^s , 
0,  if | Y1-Xi |<s. 


for  i =1 ,2 , . . . ,14.  Here  s is  some  defining  value  representing  "significant 
difference,"  and  _Y  and  X^  are  the  vectors  corresponding  to  the  definitional 
values  of  a type  and  a work  group's  actual  scores,  respectively. 

In  order  to  compute  C for  each  work  group,  average  them  within  types, 
and  then  average  them  across  types,  s was  given  to  be  some  weight  times  the 
standard  deviation  of  the  ith  index  (averaged  across  types).  The  results 
for  different  weights  ranging  from  .5  to  1.25  are  given  in  Table  27.  As 
would  be  expected,  the  average  number  of  differences  decline  as  the  signi- 
ficant difference  value  increases.  Also,  there  appears  to  be  no  significant 
difference  among  the  classification  procedures  with  respect  to  this  criterion 
(for  the  weight  values  given). 


AVERAGE  NUMBER  OF  INDEXES  PER  WORK  GROUP 
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There  is  an  interesting  relationship  between  this  zero-one  counting 
criterion  and  the  decision  tree  algorithm.  This  zero-one  method  counts 
the  number  of  times  an  index  is  considerably  different  from  the  defining 
value.  The  decision  tree  algorithm,  as  used  herein,  effectively  counts 
the  number  of  times  an  index  is  close  to  the  defining  value.  Thus,  if  the 
weight  had  been  1.5,  then  the  decision  tree  algorithm  would  have  been 
effectively  the  same  as  classifying  by  minimizing  the  zero-one  count. 

The  last  feature  to  be  gleaned  from  Table  27  is  that  there  seems  to 
be  considerable  deviation  between  most  work  group  index  values  and  the 
corresponding  index  values  of  the  definitional  types  juding  from  the  mag- 
nitude of  the  average  values  given.  However,  the  size  of  most  index 
standard  deviations  is  slightly  less  than  .4,  so,  for  example,  when  the 
weight  is  one,  4 1/2  indexes  (on  the  average)  fall  more  than  .4  from  the 
corresponding  definitional  value. 

While  not  shown  here,  there  was  little  or  no  difference  between  types 
with  respect  to  the  average  number  of  differing  work  group  values. 

Severity  of  Misclassification 

As  indicated  earlier,  errors  of  classification  may  vary  quite  widely 
in  their  impact  upon  the  group  so  classified.  The  impact  stems,  of  course 
from  the  consequences  of  applying  to  the  classified  group  a treatment 
appropriate  to  the  type  of  misclassification.  If,  for  example,  the  group 
should  have  received  survey  feedback  but  instead  receives  data  handback, 
the  consequences  might  be  on  absence  of  the  positive  change  that  would  nor 
mally  have  occurred,  or  even  outright  deterioration.  Since  appropriate 
treatment  is  determined  by  the  most  favorable  change  con sequences  (as  far 
as  our  data  carry  us).  In  no  case  would  the  result  be  more  favorable  than 
that  indicated  for  the  appropriate  treatment. 
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By  Multiple  Discriminant  Function 

To  examine  this,  we  returned  to  the  table  of  change  consequences 
contained  in  the  original  typology  analysis  (Bowers  & Hausser,  1977). 

For  classification  by  the  discriminant  function,  accuracy  (compared  to 
our  criterion,  the  distance  function  used  in  the  original  typology)  is 
displayed  in  Table  28.  Accurate  classification  Is  contained  in  those 
frequencies  falling  in  the  upper  left  to  lower  right  diagonal.  All  off- 
diagonal  cases  are  therefore  instances  of  misclasslfication.  Of  the  total 
of  6,944  cases,  5,319  (77  percent)  are  classified  correctly.  One  thousand 
six  hundred  twenty  five  (or  23  percent)  are  misclassifled. 

Of  the  5,319  classified  correctly,  5,149  are  in  other  than  Type  16, 

,for  which  type  no  treatment  evidence  was  available  in  the  original  analysis. 

To  these  5,149,  we  applied  a simple  coding  scheme: 

4 = ++  change  in  the  original  findings 

3 = + change  in  the  original  findings 

2=0  change  in  the  original  findings 
1 = - change  in  the  original  findings 

0 = — change  In  the  original  findings 

The  mean  change  for  the  5,149  correctly  classified  cases  would  there- 
fore have  been  3.10  (or  "+.") 

Of  the  1,625  misclassifled  cases,  1,157  were  In  types  (correct  or 
misclassified)  for  which  both  correct  classification  change  codes  and  mis- 
classlflcatlon  change  codes  existed.  To  assess  the  Impact  of  misclasslfi- 
cation,  the  following  coding  scheme  was  used: 

4 = Assigned  to  treatment  where  result  was  (— ) instead 
of  correct  one  which  would  have  been  (++.) 

3 = Assigned  to  treatment  where  result  was  (--)  Instead 
of  correct  one  which  would  have  been  (+.) 

or 

Assigned  to  treatment  where  result  was  (-)  Instead 
of  correct  one  which  would  have  been  (++.) 


«W 


r ~r 


NUMH  R CQPRFCT  "•  t:'ERCEfJ  » • iR«4  i 76.6 
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2 = Assigned  to  treatment  where  result  was 
of  correct  one  which  would  have  been 


(0)  instead 
(++.) 


or 


Assigned  to  treatment  where  result  was 
of  correct  one  which  would  have  been 


(-)  instead 
( + .) 


1 = Assigned  to  treatment  where  result  was 
of  correct  one  which  would  have  been 


(+)  instead 
(++.) 


or 


Assigned  to  treatment  where  result  was 
of  correct  one  which  would  have  been 


(0)  instead 
( + .) 


or 


Assigned  to  treatment  where  result  was 
of  correct  one  which  would  have  been 


(-)  instead 
(0.) 


or 

Assigned  to  treatment  where  result  was  (--)  instead 
of  correct  one  which  would  have  been  (-.) 

0 = Assigned  to  treatment  where  result  is  the  same. 


The  result  is  therefore  a change  degrading  scale  from  4 (maximum 
amount  of  degrading)  to  0 (minimum  amount  of  degrading.)  By  assuming  that 
cases  were  equally  distributed  among  tied  "best  treatments,"  we  calculated 
the  degrading  effects  of  misclasslficatlon  for  the  discriminant  function 
approach.  Of  the  1,157  cases,  857.83  were  cases  in  which  misclassiflcation 
results  in  no  degrading  of  change.  The  overall  effect  is  shown  by  type  and 
overall  in  Table  29.  This  indicates  that  the  overall  mean  degrading  would 
be  just  under  one  half  of  a position,  with  a range  from  two  full  points  to 
zero. 

However,  this  mean  could  occur  from  a number  of  different  combinations, 
some  of  them  far  more  serious  from  a severity  of  mlsclassl fication  point 
of  view.  For  the  299.17  non-zero  misclasslfleds,  the  distribution  is  as 
Indicated  in  Table  30.  The  overwhelming  majority  of  degraded  cases  represent. 
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TABLE  29 

MEAN  CHANGE  DEGRADING  FOR  MISCLASSI FIEDS  BY  TYPE 
(Discriminant  Function  Method) 


Type 

Mean  Degrading 

N 

1 

2.00 

2 

2 

.88 

56 

3 

.19 

309 

4 

.59 

137 

5 

o 

o 

48 

6 

.03 

77 

7 

1.25 

154 

8 

.00 

18 

9 

.13 

68 

10 

N/A 

N/A 

11 

.19 

105 

12 

1 .36 

28 

13 

o 

o 

86 

14 

.02 

58 

15 

1.00 

11 

16 

N/A 

N/A 

17 

N/A 

N/A 

Total 


40 


1157 
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TABLE  30 

INCIDENCE  OF  DEGRADING  BY  SEV! 
(Discriminant  Function  Methc 


Form  of  Misclassi fication 
Misclassified  as  Should  Have  Been 

++ 

+ 

++ 

+ 

++ 

+ 

0 

++ 


0 

0 

+ 


Percent 
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a tendency  toward  "no  change"  misclassification,  where  positive  change 
should  have  occurred.  In  summary  form,  89  percent  were  instances  of  no 
positive  change,  whereas  only  11  percent  were  instances  of  destructive 
change.  Stated  more  succinctly,  had  a discriminant  function  approach 
been  used,  only  one  case  in  nine  would  have  resulted  in  destructive 
consequences . 

By  Decision  Tree  Method 

For  the  decision  tree  method,  accuracy  of  classification  is  displayed 
in  Table  31.  Once  more  for  the  total  of  6,944  cases,  4,972  (72  percent) 
were  classified  correctly.  One  thousand  nine  hundred  seventy  two  cases 
(28  percent)  were  misclassified. 

Of  the  4,972,  4,804  cases  were  in  other  than  Type  16,  for  which 
no  treatment  evidence  was  available  in  the  original  analysis.  The  same 
change-coding  system  (4  to  0)  used  in  analyzing  the  discriminant  function 
approach  was  applied  to  these  cases.  The  mean  change  for  these  4,804 
correctly  classified  cases  was,  in  this  instance,  3.03  (once  more  "+.") 

Of  the  1,972  misclassified  cases,  1,458  were  in  types  (correct  or 
misclassified)  for  which  both  correct  classification  change  codes  and 
misclassification  change  codes  exist.  Of  the  1,458,  905.17  were  cases  in 
which  misclassification  results  in  no  degrading  of  change.  The  overall 
effect  is  shown  by  type  and  overall  in  Table  32.  This  shows  that  the 
overall  mean  degrading  would  be  just  over  one  half  of  a position,  with  a 

range  from  slightly  more  than  one  and  one  half  points  to  zero. 

1 

Again,  this  mean  would  occur  from  many  particular  combinations  of 
differing  severity.  For  the  552.83  nor.-zero  misclassifieds,  the  distribution 
is  presented  in  Table  33.  Once  more  the  majority  of  degraded  cases 
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THE  COMPARISON  OF  CLASSIFICATION 
BY  THE 

DISTANCE  FUNCTION  (ROWS)  AND  DECISION  TREE  (COLUMNS) 


NUMBER  CORRECT  4972  PERCENT  CORRECT  71 
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TABLE  32 

i . 

. 

MEAN  CHANGE 

OEGRAOING  FOR  MI SCLASS I FI  EOS , 

BY  TYRE 

! 

(Decision  Tree  Method) 

Type 

Mean  Degrading 

N 

1 

1.00 

1 

2 

.77 

109 

3 

.69 

356 

4 

.41 

244 

5 

o 

o 

• 

64 

6 

.39 

114 

7 

.81 

236 

8 

.00 

40 

9 

1 .57 

60 

10 

.00 

4 

11 

.68 

71 

12 

.69 

46 

13 

o 

o 

• 

39 

14 

.10 

59 

15 

.73 

15 

16 

N/A 

N/A 

17 

N/A 

N/A 

u 

Total 

.59 

1458 

1 04 


TABLE  33 

INCIDENCE  OF  DEGRADING  BY  SEVERITY 
(Decision  Tree  Method) 


Form  of  Misclassification 

Misclassified  as  Should  Have  Been  N Percent 

++  0 0 
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represent  a tendency  toward  "no  change,"  rather  than  toward  outright 
deterioration. 

However,  the  consequences  are  in  this  instance  somewhat  more  severe. 
Seventy-five  percent  were  instances  of  no  positive  change,  while  2 5 percent 
were  instances  of  destructive  change.  Stated  otherwise,  by  a decision 
tree  method,  one  case  in  four  misclassifieds  with  degrading  would  have 
resulted  in  destructive  change. 

A Comparison  to  Change  Agent  Choice 

An  interesting  question  is  the  way  in  which  either  of  these  methods 
compare  with  the  most  common  form  of  change  treatment  selection:  change 
agent  judgment  or  choice.  To  assess  this,  we  turned  to  the  data  used  in 
the  original  study  whicn  generated  the  hypothesis  (Bowers  & Hausser,  1977). 
Table  34  shows  appropriate  treatments,  as  determined  by  the  data  in  that 
original  analysis,  proportions  by  type  correctly  classified  and  misclassi fied, 
plus,  where  feasible,  a test  of  the  difference  between  proportions. 

Several  observations  seem  worth  making  from  these  data.  First,  if 
assigned  treatment  is  taken  as  indicative  of  classification,  it  is  apparent 
that  change  agents  were  exceedingly  poor  judges,  since  they  misclassified 
significantly  more  frequently  than  they  classified  correctly.  Second, 
the  differences  by  type  are  rather  marked.  Even  ignoring  types  with  small 
frequencies,  the  correct  classification  percentages  range  from  95  (for 
Type  14)  down  to  three  (for  Type  3).  Third,  the  misclassification  discre- 
pancies are  most  marked  in  the  case  of  those  types  whose  appropriate 
treatment  was  survey  feedback  alone. 

This  Is  demonstrated  more  pointedly  in  Table  35,  where  cases  from 
those  types  calling  for  survey  feedback  alone  and  non-survey  feedback 
alone  are  compared.  Only  one-fourth  of  the  cases  which  should  have  called 


106 


f 


t 

► 


k 


i 


TABLE  34 


EFFECTS  i 

OF  CHANGE  CLASSIFICATION 

Type 

N 

Correct 

Treatment* 

Percent 

Correctly 

Classified 

Percent 

Misclassi fied  i 

P 

1 

10 

DH 

70 

30 

-3.35 

N/A 

2 

58 

SF 

28 

72 

-3.35 

.001 

3 

98 

SF 

3 

97 

-9.31 

.0001 

4 

53 

SF 

28 

72 

-3.20 

.001 

5 

9 

IPC 

100 

0 

N/A 

6 

59 

SF/IPC 

34 

66 

-2.46 

.01 

7 

41 

SF 

37 

63 

-1 .66 

NS 

8 

14 

SF/IPC 

100 

0 

N/A 

9 

26 

SF 

38 

62 

-1.22 

NS 

10 

4 

DH 

100 

0 

N/A 

11 

27 

SF 

52 

48 

+0.02 

NS 

12 

46 

SF/TPC 

41 

59 

-1.22 

NS 

13 

11 

SF/LT 

100 

0 

N/A 

14 

56 

IPC/TPC/LT 

95 

5 

+6.73 

.0001 

15 

17 

IPC 

35 

65 

-1.24 

NS 

16 

-- 

(NA) 

— 

— 

— 

17 

4 

LT 

100 

0 

-- 

Total  533 

40 

60 

-4.03 

.0001 

*DH  = Data  Handback 
SF  = Survey  Feedback 

IPC  = Interpersonal  Process  Consultation 
TPC  = Task  Process  Consultation 
LT  = Laboratory  Training 

■ 
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TABLE  35 

PERCENTS  RECEIVING  APPROPRIATE  AND  INAPPROPRIATE  TREATMENT 
FOR  SURVEY  FEEDBACK  ONLY  AND  OTHER  ONLY 


Appropriate  Treatment 

Recei ved 
Appropriate 
Treatment 

N % 

Received 

Inappropriate 

Treatment 

N % 

Survey  Feedback  only 

73 

24 

230 

76 

Other  than  Survey  Feedback  Only 

81 

81 

19 

19 
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only  for  survey  feedback  were  assigned  to  that  treatment,  whereas  mnro  t >■ 
three-fourths  of  the  cases  calling  for  some  other  treatment  did,  indeed, 
receive  that  other  treatment. 

Other  data,  not  elaborated  here,  provide  a bit  more  insight  into  this 
shift.  Among  the  76  percent  of  survey  feedback  only  appropriate  groups, 
which  received  some  other  treatment,  only  15  percent  received  data  handback 


Sixty-one  percent  therefore  were  assigned  to  some  form  of  more  process- 
oriented  treatment  (IPC,  TPC,  or  LT),  a shift  which  has  been  noted  more 
broadly  in  anecdotal  terms  elsewhere  (Bowers,  1976). 

With  this  in  mind,  it  seems  useful  to  speculate  on  the  dynamics 
of  what  might  have  occurred.  First,  it  is  important  to  note  that,  at  the  time 
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received  interpersonal  process  consultation.  As  an  instance  of  the  first 
type  of  error,  it  may  have  happened  that  the  change  agents  reduced  dissonance 
from  what  were  their  preferred  styles  by  changing  the  level  to  either 
an  1-40  or  an  1-25,  both  of  which  call  for  interpersonal  process  consultation. 
Alternatively,  they  may  have  subjectively  chosen  to  focus  on  processes  inside 
the  groups  and  changed  the  level  of  organizational  climate  upward  (to  a 
Type  14),  which  calls  for  either  of  the  process  consultation  treatments 

5 

or  laboratory  training. 

Whatever  the  dynamic,  the  data  seem  to  demonstrate  that  the  classifi- 
cation process  implicit  in  treatment  selection  was  far  worse  than  would 
have  been  the  case  had  either  a multiple  discriminant  function  or  decision 
tree  approach  been  used.  It  was  on  the  whole  even  worse  than  chance. 

The  unanswerable  question,  of  course,  is  how  well  the  change  agents  would 
have  done,  had  they  been  provided  with  percentile  scores,  type  designations, 
and  treatment  information  geared  to  the  typology. 

Non-Accuracy  Based  Criteria 

Information  Required  to  Make  a Decision 

If  a particular  classification  procedure  requires  considerably  less 
information  than  another,  yet  performs  (approximately)  as  well,  it  is 
more  advantageous  in  that  collecting  and  processing  information  is  costly 
in  terms  of  time  and  money.  Different  conceptual  approaches  to  the 
reduction  of  data  are  discussed  by  Bowers,  et  al . (1977).  The  implementation 
of  these  procedures,  where  possible,  are  discussed  herein. 

One  approach  to  reduce  the  amount  of  data  used  would  be  the  deletion 
of  some  of  the  presently  used  14  S00  indexes.  One  proposed  technique 
Involved  an  analysis  of  the  discrimination  capabilities  of  different  indexes 
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within  the  Bayesian  framework,  i.e.,  those  indexes  with  likelihood  funr- 
tions  that  do  not  vary  greatly  among  profile  types.  The  inability  to 
I,  implement  the  Bayesian  approach,  as  discussed  earlier,  precluded  such 

investigation.  Also,  the  implementation  of  the  decision  tree  algorithm 
took  on  a different  form  than  originally  proposed.  The  present  algorithm 
treats  all  indexes  equally.  The  prior  concept  was  to  sequence  the  indexes 
in  order  of  importance.  This  would  have  permitted  the  comparison  of  classi- 
fication of  various  (ordered)  subsets  of  indexes  with  respect  to  accuracy. 

The  design  of  the  decision  tree  algorithm  as  used  in  this  research,  for 
reasons  other  than  features  discussed  here,  took  an  alternative  form 
(unsequenced  indexes.)  Thus,  no  numerical  investigations  of  the  effect 
of  deleting  some  indexes  took  place.  However,  this  concern  is  still 
relevant,  and  ought  to  be  considered  in  future  classification  investigations. 

Another  approach  to  date  reduction  is  to  reduce  the  number  of  indexes 
by  combining  them  into  "super-indexes,"  (rather  than  deletion.)  This, 
in  effect,  is  what  was  done  to  implement  the  discriminant  function  classi- 
fication procedure.  The  inability  to  do  the  discriminant  functions  on  the 
original  set  of  14  indexes,  however,  precludes  examining  the  effect  of  this 
consol idation. 

A final  approach  to  data  reduction  is  the  elimination  of  some  individual 
data  from  the  data  set.  In  many  organizational  settings,  it  is  desired 
(usually  by  the  organization)  to  survey  only  a sample  of  its  employees. 

The  question  is  what  effect  will  this  sampling  have  on  the  diagnostic 

. 

process.  If  classification  of  work  groups  (or  whatever  else  the  primary 
unit  of  analysis  might  be)  can  be  done  with  only  a sample  of  employees, 
significant  savings  would  occur. 
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A question  that  ought  to  be  answered  prior  to  such  an  investigation 
is,  "What  would  be  a satisfactory  percentage  agreement  in  classification 
between  the  total  population  and  using  a sample?"  If,  in  fact,  one  could 
classify  work  groups  using  only  a sample  of  employees  with  98  percent 
agreement  with  what  one  would  obtain  using  all  employees,  the  sampling 
process  would  be  indicated.  Likewise,  if  the  agreement  is  only  10  percent, 
one  would  not  want  to  sample.  Where  is  the  dividing  line?  There  is  no 
factual  information  which  indicates  what  one  ought  to  choose  for  this 
acceptable  level  of  agreement.  The  opinions  of  the  present  investigators 
were  that  an  acceptable  level  of  agreement  would  be  80  percent  (with  a 
little  room  for  give,  say  down  to  75  percent.)  This  was  then  used  as  the 
criteria  of  acceptability  in  sampling,  i.e.,  that  80  percent  or  more  of 
all  work  groups  would  have  to  be  classified  the  same  when  using  the  sample 
data  set  as  when  using  the  entire  data  set. 

A second  feature  of  this  investigation  has  to  do  with  the  sampling 
procedure  used.  With  the  Survey  of  Organizations,  the  primary  unit  of 
analysis  is  the  work  group.  Thus,  what  is  being  classified  is  the  work 
group.  Yet,  we  are  not  sampling  work  groups,  but  individual  employees. 

One  natural  approach  would  be  to  do  a stratified  sampling  procedure,  taking 
the  same  percentage  of  employees  from  within  each  work  group.  However, 
as  the  support  for  sampling  is  usually  from  the  organization,  their  concerns 
ought  to  be  considered  here  as  well.  Our  experience  is  that  the  organiza- 
tion is  usually  interested  in  taking  a sample  of  all  employees  without 
regard  to  work  group  affiliation.  The  effect  of  this  is  to  have  varying 
proportions  of  employees  within  each  work  group,  with  no  assurance  of  a 
particular  work  group  having  any  representation  in  the  sample.  While  this 
may  represent  the  worst  possible  situation  with  respect  to  diagnosis  and 
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research,  it  has  viability  as  being  within  real  world  constraints.  For 
this  reason,  in  an  attempt  to  investigate  the  effect  of  sampling  employees 
on  classification,  unstratified  samples  were  selected. 

Two  different  sample  sizes  were  chosen  for  investigation;  67  percent 
and  33  percent.  The  procedure  was  to  randomly  select  67  percent  (337,) 
of  all  employees,  reform  work  group  scores  with  the  available  data,  and 
then  to  classify  the  work  groups  using  the  distance  function.  The  resulting 
work  groups  which  had  only  one  individual's  scores  were  omitted  (as  were 
those  very  large  work  groups.)  The  original  number  of  classifiable  work 
groups  (with  size  between  two  and  40)  was  5,651.  For  the  67  percent  sample, 
there  were  4,757  work  groups,  a reduction  of  15.fi  percent.  In  the  33  per- 
cent sample,  there  were  2,914  work  groups,  a reduction  of  48.4  percent. 

These  reductions  are  understandable  in  light  of  the  fact  that  of  the 
original  5,651  groups,  16.5  percent,  15.7  percent  and  12.9  percent  were  of 
size  two,  three,  and  four,  respectively. 

The  results  of  the  sampling  effects  on  classification  are  reported  in 
Tables  36  and  37.  The  percentage  of  classification  agreement  is  65.1 
percent  and  42.7  percent  for  the  67  percent  and  33  percent  samples, 
respectively.  These  figures  are  well  below  the  pre-established  acceptable 
rate  of  80  percent.  The  implication  of  this  is  that  sampling  does  negatively 
Impact  organizational  diagnosis  at  the  work  group  level. 

It  is  possible  that  had  the  sampling  been  done  in  a stratified  manner 
(say,  67  percent  of  all  work  groups),  the  percent  agreement  would  have  been 
higher.  It  Is  unlikely,  however,  that  It  would  be  15  percent  higher. 

This  does  Indicate  further  investigation.  Other  sampling  plans  might 
also  be  Investigated.  In  particular,  one  might  sample  only  from  those  work 
groups  of  size  four  or  larger. 


NUMBER  CORRECT  = 3098  PERCENT  CORRECT  = 65.125' 


CLASSIFICATION  AGREEMENT  USING  A 33%  SAMPLE 
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NUMBER  CORRECT  = 1243  PERCENT  CORRECT  = 42.656 
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Another  response  to  the  results  discussed  above  might  be  the  suggestion 
to  take  a larger  (than  67  percent)  proportion  of  employees,  say  80  percent. 
This  would  increase  the  agreement  in  classification.  However,  the  benefits 
of  using  a sample  rather  than  the  entire  population  are  greatly  diminished. 
With  that  large  of  a proportion  of  the  employees,  the  time/cost  savings 
are  minimized,  the  sampling  costs  are  increased,  and  the  questioning  of  the 
non-participation  of  only  a few  by  the  employees  themselves  is  increased 
significantly.  In  originally  choosing  the  67  percent  level,  the  authors 
consider  it  to  be  (at  or  near)  the  maximum  level  for  sampling,  given  both 
organizational  constraints  and  employee  responses. 

It  was  decided  not  to  explore  the  discriminant  function  and  decision 
tree  classification  algorithms  on  the  67  percent  and  33  percent  samples. 

As  seen  in  the  previous  section,  the  accuracy  of  classification  of  these 
techniques,  based  on  full  data,  is  below  80  percent.  The  ability  to 
reproduce  the  "correct"  classi fication  would  be  lower  when  using  a frac- 
tional data  set.  A note  of  caution:  a technique  may  be  able  to  reproduce 
itself  with  greater  accuracy  on  the  sample  data  than  the  distance  function 
procedure.  What  is  being  said  here  is  that  a technique  will  not  reproduce 
the  distance  classification  when  using  a partial  data  set  than  when  using 
the  total  data  set. 

In  summary,  using  samples  of  employees  from  an  organization,  rather 
than  surveying  all  employees,  appears  to  significantly  alter  the  classi- 
fication of  work  groups,  and  thus  cannot  be  recommended  for  use  in  an 
organizational  diagnosis  project. 
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Amount  of  Data  Required  to  Develop  the  Diagnostic  Process 

The  three  classification  techniques  require  differential  data  bases 
for  their  development.  The  purpose  of  this  section  is  to  discuss  available 
information  regarding  the  necessary  developmental  data  bases. 

As  earlier  reported,  the  distance  function  technique,  when  applied 
to  the  total  data  set,  generated  clusters  of  work  groups  which  represented 
the  typology  with  a fairly  high  degree  of  compatabil ity.  However,  there 
is  no  need  for  any  data,  other  than  what  may  be  used  to  generate  the 
typology,  before  implementing  the  distance  function  technique.  That  is, 
if  one  was  to  receive  a typology  from  an  outside  source,  no  developmental 
work  would  be  necessary. 

However,  in  most  instances,  the  typology  will  have  to  be  developed. 

In  the  present  case,  it  has  been  shown  that  a typology  based  on  a develop- 
mental sample  of  N=600  was  general izable  to  the  total  data  set  (N=6,944). 

The  decision  tree  algorithm  was  based  on  intervals  surrounding  the 
means  of  indexes  of  the  types.  If  one  had  intervals  provided,  no  develop- 
mental data  would  be  necessary.  However,  as  before,  this  is  unlikely. 

To  obtain  the  intervals,  one  could  easily  take  the  means  and  standard 
deviations  of  the  types  resulting  from  the  development  of  the  typology. 

The  stability  of  the  intervals  used  in  the  decision  tree  algorithm  would 
be  of  concern  (in  addition  to  the  validity  of  the  typology).  To  insure 
that  the  sample  mean  is  close  to  the  population  mean,  one  would  need 
samples  of  approximate  size  400,  1,600,  6,300  for  the  standard  error  of 
J to  be  .02,  .01,  ana  .05,  respectively,  for  each  of  the  various  indexes 
within  types. 
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The  discriminant  function  technique  requires  not  only  the  typology, 
but  an  external  "correct”  classification  on  a developmental  sample. 

The  size  of  the  developmental  sample  is  complicated  by  the  relative 
frequency  of  the  various  types.  The  smallest  relative  frequency  times 
the  sample  size  ought  to  be  at  least  50  or  more  (the  greater  the  number 
of  predictors,  the  larger  this  should  be). 

The  results  of  the  discriminant  function  analysis  given  earlier  do 
have  positive  implications.  The  ability  of  the  discriminant  function  to 
classify  the  test  sample  with  the  same  accuracy  as  the  developmental 
sample  indicates  the  stability  and  transferabil i ty  of  the  discriminant 
equations  from  a developmental  sample  to  new  data. 

The  conclusions  of  these  remarks  on  the  amount  of  information  required 
to  develop  the  classification  procedures  are  presented  in  Figure  4. 

Figure  4 
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Ease  of  Calculation 

The  ideal  diagnostic  technique  is  not  only  accurate,  but  relatively 
simple  to  perform.  In  an  organizational  diagnosis  situation,  the  diagnosti- 
cian may  often  be  a change  agent  who  would  like  to  make  treatment  recommen- 
dations while  on  site  and  without  having  to  resort  to  accessing  and/or 
experience  the  time  delay  of  electronic  data  processing  (EDP)  facilities. 

Three  ease-of-calculation  factors  are: 

(1)  Can  be  calculated  on-site  versus  in  a central  location. 

(2)  Can  be  done  by  hand  or  with  a hand  calculator  versus 
requiring  EDP  facilities. 

(3)  The  amount  of  personnel  time  involved  is  relatively  small. 

While  not  identical,  these  three  factors  are  clearly  interrelated. 

While  analyzing  the  different  classification  techniques  with  respect 
to  these  issues  is  done  subjectively,  the  difference  between  the  techniques 
is  relatively  clear. 

Both  the  distance  function  and  the  discriminant  function  techniques 
are  clearly  most  easily  done  by  computer.  The  distance  function  technique, 
however,  given  a table  of  index  values  for  the  types,  could  be  implemented 
with  a hand  calculator  (preferably  one  with  accessible  storage  and  minimal 
statistical  routines)  in  15  to  20  minutes  (per  unit  of  analysis,  e.g., 
work  group)  by  anyone  with  minimal  training  in  the  technique.  The  decision 
tree  algorithm  Is  clearly  the  most  straightforward,  least  cumbersome 
technique.  Anyone  could  use  the  technique  to  classify  a new  work  group 
in  five  minutes  or  less  --  without  the  aid  of  anything  but  a table  of 
Intervals  and  : pencil.  The  number  of  calculations  for  the  decision  tree 
algoiiLnm  is  the  smallest  (14  x 17  yes,  no  questions),  second  lowest  for 
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the  discriminant  function  (although  quite  complex),  and  highest  for  distance 

function  (although  relatively  simple  arithmetically). 

i 

In  summary,  the  decision  tree  methodology  is  the  most  portable, 
easily  applied,  and  requires  the  minimal  personnel  time.  The  distance 

> 

function  can  be  done  on-site,  with  a hand  calculator,  at  a moderate  level 
of  complexity.  The  discriminant  function,  for  reasonable  application, 
would  require  programmable  equipment  with  over  200  storage  units. 
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DISCUSSION  AND  CONCLUSIONS 

We  began  this  research  with  a view  that  organizational  diagnosis, 
as  practiced  in  most  development  efforts,  is  largely  unsystematic,  if  nof. 
capricious.  We  proposed  to  use  a work  group  diagnostic  typology  as  a 
basis  for  comparing  several  alternative,  data  based  methods  of  performing 
the  diagnostic  task.  As  such,  the  questions  which  we  wished  to  investigate 
revolved  around  the  durability  of  the  original  typology  and  the  effective- 
ness of  the  distance  function  approach  upon  which  it  was  based,  the  accuracy 
with  which  discriminant  function,  Bayesian,  and  decision  tree  methods 
would  classify  groups,  the  ability  to  employ  reduced  information  sets 
in  diagnosis,  and  the  comparative  ease  of  classification  of  the  various 
methods.  The  previous  sections  of  the  report  have  presented  the  results 
of  our  research;  in  the  present  section  it  remains  to  discuss  their  meaning 
and  significance. 

Durability  of  the  Typology  and  Ability  of  the  Techniques  to  Reproduce  it 

The  typology  does,  indeed,  appear  to  be  quite  durable.  It  holds  up 
quite  nicely  when  applied  to  the  entire  population  of  groups.  While  the 
Bayesian  method  met  a dead-end  early  on,  the  remaining  methods  --  distance 
function,  discriminant  function,  and  decision  tree  — all  produce  within- 
cluster  variances  which  are  not  only  not  greater  than,  but  are  actually 
smaller  than,  wlthin-cluster  variances  In  the  original  developmental  study's 
sample.  This  is  significant  because,  had  the  original  been  a statistical 
or  sampling  artifact,  we  would  have  expected  applying  the  distance  function 
(or  any  of  the  other  methods)  to  the  entire  population  to  produce  in  the 
fit-forcing  withln-cluster  variances  much  larger  values  than  those  obtained 
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in  the  original  study.  The  fact  that,  if  anything,  the  reverse  happens 
provides  rather  persuasive  evidence  that  the  typology  really  exists  and 
represents  more  than  an  artifact,  sampling  or  otherwise,  of  the  methods 
used  in  the  developmental  study. 

Beyond  this,  on  the  issue  of  size  of  within-cluster  variances,  no 
method  is  decidedly  superior  to  the  others.  All  produce  variance  values 
near  or  below  those  in  the  developmental  study.  However,  as  might  perhaps 
be  expected,  the  distance  function  produces  in  general  cluster  mean  values 
closer  to  our  criterion  of  the  "true"  values  (those  of  the  original  study) 
than  do  either  the  discriminant  function  or  decision  tree  methods. 

Although  we  have  rather  arbitrarily  chosen  the  cluster  means  of  the 
original  study  as  representing  the  time  values,  the  problem  is  no  doubt 
one  of  successive  iterations,  attempting  each  time  to  come  closer  to  the 
mark.  This  provides  us  with  a somewhat  broader  perspective  on  the  problem. 

That  the  typology  does  seem  quite  durable  suggests  rather  strongly 
that  there  are  true  types,  with  true  cluster  values  in  14-space,  which 
exist.  Viewed  in  somewhat  factor  analytic  terms,  there  are  true  vectors, 
to  which  we  are  attempting  in  our  research  to  rotate  clusters  classified 
by  various  methods.  The  variance  (of  groups  in  the  population  in  14-space) 
can  thus  be  broken  down  into  two  components: 

(a)  Variance  between  the  true  vectors  and  mean  cluster 
vectors. 

(b)  Variance  withln-clusters  of  classified  cases  from 
their  cluster  means. 

If  each  method  aligned  its  cluster  vectors  perfectly,  they  (the  methods) 
would  differ  In  accuracy  only  by  within-cluster  variance.  M we  have  seen, 
the  three  methods  do  not  differ  appreciably  on  this.  As  a general 
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statement,  therefore,  we  might  say  that  the  evidence  for  the  typology  i' 
quite  strong,  that  each  method  produces  vectors  which  are  equally  "tight," 
but  that  the  discriminant  function  and  decision  tree  methods  produce 
cluster  mean  values  which  are  somewhat  more  distant  from  the  original 
values  than  are  those  of  the  distance  function.  Since  the  latter  may  in 
some  measure  reflect  the  fact  that  the  same  method  was  used,  great  signi- 
ficance ought  perhaps  not  be  attached  to  it. 


Problems  in  Implementing  a Bayesian  Approach 

The  Bayesian  method  met,  within  the  limits  of  this  study,  a dead  end 
because  the  profile  indexes  are  not  independent.  Pursuing  it  despite  this 
fact  produced  values  for  the  odds-likelihood  ratio,  which  exceeded  the 
capacity  of  what  is  an  up-to-date  computer  (AMDAHL  470V6).  While  a Bayesian 
approach  cannot  be  permanently  dismissed,  its  feasibility  clearly  requires 
that  some  answer  be  found  to  the  problem  of  index  interdependence.  This 
interdependence  is  exacerbated  when  one  has  a fairly  large  set  of  predictors 
(say  more  than  7).  It  seems  highly  unlikely,  however,  that  measures  of 
organizational  functioning  will  ever  be  independent,  and  most  likely,  will 
continue  to  be  many  in  number. 


Proportion  of  Correct  Classification 


The  remaining  alternatives  to  the  distance  function  itself  --  namely, 
discriminant  function  and  decision  tree  --  seem  on  the  surface  to  produce 
approximately  equal  proportions  of  correct  classification  (77  versus  72 
percent.)  However,  this  is  true  only  If  ties  In  the  decision  tree  procedure 
that  Involve  the  correct  type  are  assigned  to  that  correct  type.  Omitting 


tied  cases,  classification  accuracy  for  the  decision  tree  drops  to  57  percent. 
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The  decision  tree  procedure,  as  implemented  herein,  produces  many  tied 
classifications.  Here,  as  in  its  misclassifications,  it  seems  unable  to 
reliably  distinguish  straight-line  profiles  from  divergent  profiles  at 
approximately  the  same  level  and  tends  to  consign  the  former  over  much 
to  the  latter. 

The  tied  classification  problem  for  the  decision  tree  method  led  us 
to  apply,  as  an  after  thought,  the  severity  calculation  procedure  (cf. 
p.  102)  applied  to  misclassifications,  to  the  ties  themselves.  In  other 
words,  perhaps  the  decision  tree's  tendency  to  ties  is  not,  in  its  effects, 
much  of  a problem.  Table  38  presents  the  results  of  this  further  analysis. 
They  reveal  that  the  problem  is  not  appreciably  different  from  that  of 
misclassification.  A large  majority  represent  instances  of  an  absence  of 
positive  change.  However,  again  as  with  decision  tree  misclassifications, 
approximately  one  case  in  four  or  five  would  result  in  destructive  change. 

In  part,  of  course,  the  ties  and  misclassification  problem  reflects 
the  version  of  a decision  tree  selected  for  implementation  in  the  present 
study.  In  theory,  we  might  alternatively  have  chosen  a sequenced  form, 
in  which  being  high  or.  the  first  index  in  the  sequence  excludes  some  number 
of  possibilities  altogether.  In  fact,  however,  this  possibility  se'.-ms  to 
be  excluded  by  the  state  of  the  knowledge  base.  While  there  are  sets  of 
variables  known  on  the  basis  of  research  to  be  causal  in  relation  to  other 
variables,  the  sequence  is  not  fixed  nor  immutable.  There  is  instead  more 
nearly  a gestalt  flavor  to  the  pattern  of  variables,  with  some  "usually" 
or  "generally"  thought  to  precede  others.  Therefore,  something  like  the 
present  version  of  a decision  tree  seems  the  only  feasible  form,  and,  as 
we  have  seen,  its  accuracy  leaves  something  to  be  desired. 
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TABLE  38 

t CHANGE  DEGRADINGS  FROM  DECISION  TREE  TIES 
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68 
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Zero-One  Count 

The  three  methods  --  distance  function,  decision  tree,  and  discri- 
minant function  --  do  not  differ  appreciably  on  a zero-one  count  comparison. 
At  roughly  .67  standard  deviation  units,  approximately  half  of  the  indexes 
per  work  group  exceed  the  designated  value  of  the  assigned  type  by  that 
much  or  more.  Overall  accuracy  of  classification  notwithstanding,  what 
this  suggests  is  considerable  variation,  perhaps  idiosyncratic,  perhaps 
for  particular  indexes.  However,  the  average  number  of  indexes  differing 
significantly  from  the  designated  value  exceeds  only  slightly  that  which 
would  be  explained  by  random  chance  according  to  normal  distribution  theory. 

Severity  of  Mi sclassifi cation 

The  methods  differ  rather  substantially  in  the  consequences  of  their 
misclassification.  While  the  consequences  of  correct  classification  under 
either  the  discriminant  function  or  decision  tree  procedure  would  be  a 
positive  improvement  of  one  quarter  to  one  half  a scale  point,  and  the 
majority  of  misclassifications  under  either  would  result  simply  in  an 
absence  of  positive  change,  the  decision  tree  would  much  more  frequently 
produce  destructive  change  (outright  deterioration)  in  its  misclassifieds. 
Under  the  decision  tree  method,  one  misclassification  in  four  would  produce 
destructive  change,  whereas  only  one  in  nine  would  do  so  using  a discriminant 
function.  Either  method  would  appear  superior  to  change  agent  choice, 
however,  since  the  latter  produced  results  significantly  worse  than  chance. 
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Use  of  Reduced  Information  Sets 

Our  results  indicate  that  classification  accuracy  suffers  rather 
dramatically  when  respondent  sampling  occurs.  Even  a two-thirds  random 
sample,  which  would  generally  represent  a higher  proportion  than  is 
considered  where  sampling  is  proposed,  produces  a reduction  in  classifi- 
cation accuracy  that  is  unacceptable. 

However,  at  least  a portion  of  the  results  suggest  that  reduced  numbers 
of  measures  might  produce  equally  satisfactory  results.  Our  generation 
of  "super-indexes"  for  use  by  the  discriminant  function,  together  with  the 
fact  that  fewer  than  the  number  of  items  involved  in  each  would  presumably 
be  required  to  obtain  adequate  internal  consistency,  suggests  that  it  might 
be  possible  to  generate  a "short  form"  of  the  survey  which  would  satisfy  the 
diagnostic  classification  purpose.  It  should  be  noted,  however,  that  that 
might  not  be  compatible  with  feedback  and  development  purposes. 

Ease  of  Calculation 

Only  the  decision  tree  method  provides  promise  of  easy  calculation. 

Were  its  accuracy  and  severity  of  mlsclassi fication  problems  to  be  solved,, 
therefore,  it  might  present  significant  potential  for  being  transportable 
and  not  bound  to  a large  data  bank  and  computer  capability. 

Conclusions  from  the  Research 

We  conclude  from  this  rather  complex  research  project  that  the  typology 
which  undergrids  our  efforts  at  data  based  (computer  assisted)  diagnosis  is 
Indeed  durable.  Of  the  methods  proposed  for  examination,  two  seem  not  to 
be  likely  candidates  at  any  Immediate  time  for  on-line  use  in  real  world 
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settings.  A Bayesian  system,  to  be  at  all  feasible,  must  first  solve 
the  problems  of  dealing  with  interrelated  measures.  This  is  no  mean  feat, 
since  it  seems  highly  likely  that  measures  of  organizational  functioning 
are  intrinsically  inter-related,  for  a whole  host  of  systemic  reasons. 

Any  effort  to  solve  the  problem  by  orthogonal  factor  analytic  treatment  of 
measures,  for  example,  seems  likely  to  encounter  the  difficulty  that  such 
efforts  have  in  organizational  settings  in  the  past.  What  is  orthogonal 
in  one  setting  at  one  time  is  no  longer  so  in  other  settings  or  at  future 
times.  Thus  a Bayesian  approach  to  organizational  diagnosis  seems  improbable 
of  realization  without  further  research. 

A decision  tree  approach,  while  eminently  doable,  as  the  present 
research  has  shown,  seems  to  suffer  from  a mismatch  between  its  requirements 
and  the  reality  of  the  situation.  The  state  of  the  knowledge  base  does  not 
permit  a decision  tree  of  optimal  form  (i.e.,  a "sequenced"  one).  On  the 
other  hand,  the  fact  that  the  non-sequenced  form  (like  any  other  form)  makes 
yes-no  judgements  means  that  it  sacrifices  information  that  both  a distance 
and  a discriminant  function  use.  As  a result,  it  makes  errors  of  classi- 
fication that  are  both  too  frequent  and,  more  importantly,  too  costly  in 
their  likely  impact  upon  change  outcomes. 

Even  the  decision  tree  method,  however,  and  certainly  each  of  the  others 
examined,  would  yield  results  superior  to  change  agent  choice,  which  did 
significantly  worse  than  chance. 

Respondent  sampling  seems  not  to  be  a feasible  route  to  making  the 
survey-diagnostic  task  more  manageable,  since  even  relatively  large  samples 
produce  unacceptable  errors  of  accuracy.  Collapsing  of  measures  and  reduction 
of  the  number  of  items  does  seem  a potentially  feasible  route,  however, 
provided  that  the  task  of  the  measures  is  limited  to  diagnostic  classifications. 
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Perhaps  one  of  the  most  significant  findings  is  that  neither  of  the 
comparatively  accurate  methods  is  easy  of  calculation.  Only  the  decision 
tree  method  has  this  potential  quality  and  its  problems  of  accuracy  clearly 
outweigh  that  ease.  It  seems  likely,  therefore,  that  any  acceptable 
method  of  organizational  diagnosis  will  for  the  foreseeable  future  require 
the  amassing  of  a large  data  bank  and  the  availability  of  a significant 
computer  accessibility. 

Against  this  backgrop,  our  conclusion  is  that  research  should  be  pushed 
on  the  following  issues  of  organizational  diagnosis,  answers  to  all  of 
which  could  greatly  broaden  the  scope  of  what  is  possible  and  implementable. 

. Additional  exploration  of  the  typology,  including  any 
possible  modifications  of  the  definitional  values,  its 
general izabil ity  and  durability. 

. Enlarge  the  knowledge  base  with  respect  to  the  effects 
of  different  treatments  on  different  types  within  the 
typology. 

. Investigation  of  the  effect  of  level  on  work  group 
type.  One  question  which  Illustrates  this  issue  is 
whether  a Type  three,  level  one  is  the  same  as  a Type 
three,  level  four.  (Level  refers  to  level  of  the  work 
group  in  the  organization.) 

. How  the  accuracy  of  classification  Is  affected  by  the 
level  of  the  work  groups  being  classified. 

. The  effect  on  the  typology  of  converting  work  group 
scores  to  level  percentiles  prior  to  classification. 
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Additional  inquiry  into  the  decision  tree  type  of 
algorithm,  with  possible  usage  of  non-uniform 
intervals,  both  across  types  and  across  indexes. 
Investigation  into  other  forms  of  data  reduction 
with  respect  to  sampling.  Possibilities  might 
include  stratified  sampling  or  sampling  only  from 
work  groups  with  size  above  a, certain  level. 
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SUMMARY 

The  process  of  organizational  development  and  change  contains  two 
component  subprocesses,  diagnosis  and  therapeutic  intervention.  Although 
equally  crucial  to  the  success  of  any  developmental  effort,  diagnosis  takes 
prior  importance  because  it  occurs  earlier  in  the  developmental  stream. 

Thus,  an  important  part  of  the  consultant's  role  is  often  presumed  to  con- 
sist of  translating  a wide  variety  of  symptoms  into  a coherent  pattern 
that  permits  planning  and  carrying  out  appropriate  remedial  action. 

If  this  is  the  desirable  state  of  affairs,  it  is  scarcely  what  in 
fact  obtains  in  most  organizational  development  projects.  Reviews  and 
exchanges  in  the  literature,  plus  the  meager  formal  evidence,  suggest  that 
the  field  is  characterized  by  ad  hoc  problem-solving  and  by  efforts  to 
simply  justify  whatever  it  is  that  the  consultant  knows  how  to  do. 

If  progress  is  to  be  made  in  organizational  diagnostic  methodology, 
procedures  must  be  explored  and  tested  which  offer  promise  of  systematizing 
the  task.  In  other  fields  with  a similar  diagnostic  task,  such  as  medicine 
and  psychiatry,  advances  have  been  registered  by  computerizing  portions 
of  the  process.  In  addition  to  speeding  up  the  procedure,  this  offers 
promise  of  handling  as  well  the  general  problem  of  the  clinical  judge 
(consultant):  statistical  prediction,  here  as  elsewhere,  is  likely  to  prove 
far  more  accurate  than  clinical,  or  clinically  mediated,  prediction. 

An  attempt  to  develop  and  test  more  rigorous  diagnostic  procedures  in 
organizational  development  should  be  based  upon  a model  containing  principles 
of  organizational  functioning.  The  Likert  meta-theoretical  statement 
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formed  the  model  underlying  the  research  herein  reported,  as  well  as  the 
source  of  content  of  the  Survey  of  Organizations1  data  bank,  containing 
approximately  7,000  work  groups,  used  to  test  the  propositions  which  were 
investi gated. 

An  earlier  study  had  used  a hierarchical  grouping  technique  which 
employs  the  distance  function  as  a measure  of  work  group  Survey  of 
Organizations'  profile  similarily  to  generate  a 17-type  typology.  The  type 
profile  means  from  this  earlier  study  were  taken  as  the  patterns  of  symptoms 
defining  the  17  alternative  functional  states,  and  the  distance  function 
was  used  with  all  7,000  groups  to  define  the  "true"  type  of  each  group. 
Against  this  criterion,  a number  of  questions  were  addressed: 

. Is  there  evidence  that  the  original  typology  holds  up 
when  applied  to  the  entire  population  of  groups;  that 
is,  does  it  appear  to  reflect  reality? 

. If  the  typology  holds  up,  how  effective  is  a distance 
function  in  classifying  by  type? 

. Using  the  distance  function  classification  as  the 
criterion,  how  accurate  are  discriminant  function, 
decision  tree,  and  Bayesian  methods  in  classifying 
groups? 

. To  what  extent  are  these  methods  able  to  classify 
groups  correctly  using  reduced  information  sets? 

. Do  the  methods  differ  in  ease  of  calculation? 

The  results  indicated  that  the  typology  did  indeed  hold  up,  with  within- 
cluster  variances  for  all  methods  used  to  reproduce  it  which  no  larger 
than,  and  ordinarily  smaller  than,  those  of  the  original  study.  As  might 
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be  expected,  the  distance  function  produced  profile  mean  scores  which  were 
closer  to  those  of  the  original  study  than  were  those  of  the  other  method, 
examined. 

The  3ayesian  method  met  with  an  early  dead-end,  since  the  survey  indexes 
were  both  intrinsically  and  in  fact  interrelated.  An  attempt  to  calculate 
odds-1 ikel ihood  ratios  despite  this  fact  produced  values  which  exceeded 
the  capacity  of  what  is  a very  large  computer.  While  this  method  ought 
not  be  permanently  dismissed,  further  progress  is  obviously  contingent  upon 
some  answer  being  found  to  the  problem  of  index  interdependence. 

On  the  surface,  the  decision  tree  and  discriminant  function  methods 
appeared  to  be  approximately  equal  in  their  ability  to  classify  groups 
correctly.  However,  the  decision  tree  method,  at  least  as  implemented 
within  this  study,  seemed  prone  to  an  inability  to  differentiate  among 
types  with  some  characteristics  in  common.  Specifically,  it  generated 
an  excessive  number  of  ties.  For  both  these  ties  and  its  outright  misclassi- 
fications,  there  was  a degrading  of  projected  change  which  was  quite  severe: 
one  out  of  four  misclassified  groups  would  experience  outright  destructive 
consequences,  whereas  only  one  in  nine  would  do  so  using  a discriminant 
function  method.  Any  of  the  methods  was  superior  to  consultant  choice, 
however,  where  results  were  significantly  worse  than  chance. 

Reduced  information  sets  in  the  form  of  fewer  measures  seem  entirely 
plausible  from  the  study's  results.  On  the  other  hand,  sampling  of  respondents 
produced  unacceptable  decrements  in  classification  accuracy  even  at  high 
percentages. 

Both  the  distance  and  discriminant  function  methods  require  large 
developmental  data  banks  and  a significant  computer  capacity  for  their 
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implementation.  Only  the  decision  tree  method  could  stand  alone  without 
these  supports.  However,  its  problems  of  accuracy  and  of  the  consequences 
of  its  errors  make  that  transportability  a dubious  advantage  at  present. 
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